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REMARKS 

Claims 1-35 are cancelled. Claims 36, 38-40, 42, 43, 46-50, 52, 53, and 56-58 are 
amended. Claims 60-73 are withdrawn from consideration by the examiner. 

Claims 36-57 and 59 stand rejected under 35 USC 103(a) as being impatentable over 
Green et al (US 7,547,817) in view of Qui et al (Intl. Immunol. 1999; 11: 37-46). 

Claim 58 stands rejected under 35 USC 103(c) as being unpatentable over Green et al in 
view of Qui et al and further in view of GenBank AC073553 (September 2002). 

Claims 36 and 52 stand rejected under 35 USC 103(a) as being unpatentable over Green 
et al in view of Qui et al and further in view of Harriman et al. 

Each of these rejections are addressed and traversed in the following remarks. 

Background of the of the invention 

The IgH locus structure and the mechanism of class switch recombination are well- 
known in the art (see Immunology, Ivan Roitt et al, Third Edition, 1993, Mosby, pages 
5,9 to 5.11 and 1L9 to 11.12; Annex I). B cell maturation requires the expression of 
membrane IgM on B cells surface. The production of a selected antibody isotype 
requires the development of IgM producing B cells which subsequently undergo isotype 
class switching from IgM to the IgG or IgA, or IgE isoptypes. Isotype class switch is 
mediated through a deletional recombination event (class switch recombination or CSR) 
occurring between tandem directly repetitive switch regions (S) present 5' of all IgH 
constant region genes except C5. Transcription starting from an I exon located 
immediately upstream of the switch region and continuing through the switch region is 
required for CSR. Enhancers and cytokine response sequences are known to lie in the 
region near the I promoter. 

Brief description of the claimed invention 



6 



Application No. 1 0/577,061 



Attorney Docket No.40521U 
Customer No.: 50438 



The claimed invention relates to a transgenic knock-in mammal, v^erein the 
endogenous immimoglobulin heavy chain locus (IgH) has been modified by 
homologous recombination to replace the switch sequence with a transgene 
containing a himian heavy chain constant region gene Ca or a segment of the Ca gene 
containing at least an exon encoding the CH3 domain and a membrane exon. 

Figure 1 of the present specification illustrates the endogenous IgH locus in mice, the 
targeting vector which can be used for homologous recombination and the murine locus 
after homologous recombination. 

The deletion of the switch sequence Sjx in the endogenous IgH locus and its replacement 
with the human Ca transgene produces the following effects on the immunoglogulin 
heavy chain genes expression: 

(i) a chimeric human IgA heavy chain in which the constant region is human and 
the variable region is from the non-human mammal is expressed to high level; 
this chimeric IgA heavy chain benefits from a completely diversified repertoire 
corresponding to the normal repertoire genemted by rearrangements of the VH, 
D and JH segments at the non-human mammal IgH locus, 

(ii) the expression of the endogenous Cfi gene responsible for the synthesis of IgM 
heavy chains is abolished, and 

(iii) the expression of the other immunoglobulin heavy chains genes which code for 
the other classes of immunoglobulins (IgG and IgE) is greatly reduced due to 
the blocking of the Ig class switch toward the immunoglobulin constant genes 
located downstream of C^ on the endogenous IgH locus. 

These specific characteristics of the non-human transgenic mammal of the claimed 
invention are demonstrated in the examples. For example, Example 1 demonstrates that 
the non-human transgenic mammal of the claimed invention produces antibodies which 
contain large quantities of chimeric human IgAs (in the gram per litre range in mice) but 
no IgM. Example 3 demonstrates that the transgenic mammal is capable of producing 
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antibodies with high afiSnity as a secondary response to the antigen since its B 
lymphocytes can recruit the somatic hypermutation phenomenon. 

The brief description of the claimed invention above is intended solely for purposes of 
clarification and is not intended to limit either the scope of any pending claims. 

• Green g/g/, (US 7,547,817) 

Green describes himian antibodies producing transgenic mammals (Xenomouse™) 
which are obtained by introducing yeast artificial chromosome cloning vectors 
containing large germline ftagments (unrearranged) of the human Ig locus (human Ig 
YAC transgene), into the somatic and germline cells of a mammal (column 2, line 3 1 to 
column 3, line 8 and column 4, lines 4-14). 

Green describes that current technologies for obtaining a transgenic mouse which 
produces an antibody of the desired isotype requires antibody re-engineering in vitro 
which is labor intensive, slow and expensive (column 4, line 60 to colunm 5, line 36). 

Green is thus directed to solving the specific problem of obtaining a pre-selected human 
antibody isotype fi-om a transgenic mouse (column 5, lines 53-55), 

Green describes that due to the differential responsiveness of moxise and human switch 
regions to lymphokines and other activators it is desirable to have heterologous switch 
regions controlling CSR in human antibodv producing mice (column 12, lines 29-33). 

Green discloses a transgenic mammal containing in its somatic and germline cells an 
unrearranged human inununoglobulin heavy chain (IgH) YAC transgene containing VH 
genes, all the D elements, all J elements, Sji, Cn and C5 fi-om human chromosome 14, 
and a chimeric himian IgH constant region gene in which human constant region gene 
sequences encoding the desired heavy chain isotype are operablv linked to an 
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heterologous or non-cognate switch regioa (column 5, line 66 to column 6, line 3; 
column 6, lines 21 to 36). For example, a mouse switch region is operably linked to a 
human gamma, alpha or epsilon constant region coding segment, or a human switch 
region is operably linked to a human constant region segment, the switch region being 
from.a different isotype than the constant region coding segment (column 6, lines 40-45 
and 51-55).The heterologous switch region controls switching firom the human IgM to 
the downstream human Ig of the desired isotype. The transgenic mammal of Green is 
engineered to produce human IgM and human Ig of the desired isotype (column 5, line 
66 to column 6, line 3). 

A first transgenic mouse line producing human IgM and IgG2 antibodies (Xenomouse) 
was previously derived by Green, ifrom the yHlC transgene composed of 66 VH genes, 
all the D elements, all J elements, C\i and C8 and all regulatory elements in germline 
configuration, appended in 3' with a 22 kb fiagment containing the himian Cy2 gene, 
including its switch region (Sy2), and a 4 kb fiagment containing the mouse 3' enhancer 
element (column 9, line 58 to column 10, Ime 42). 

The human IgH YAC transgenes disclosed in Green are engineered by introducing a 
targeting vector having 5' and 3' flanking homology to yHlC and an appropriate 
selection marker, into yeast carrying yHlC. Such vector can be recombined in vivo in 
yeast to replace the human Sy2 and Cy2 with the mouse Syl fimctionally linked to a 
human CH coding sequence {e.g. human or the human Cyl) or the human Cy2 by 
another human CH gene, in yeast carrying yHlC (column 12, Ime 36 to column 13, line 
9 and figures 1 to 7). To produce an antibody of the desired isotype, new transgenic 
Xenomouse are then generated by introducing the recombinants IgH YAC transgenes 
into mouse ES cells. 

Harriman et al. J. Clin. Invest.. 1996. 97. 477-485 and Oiu et aL Int. Immunol, 1999. 
11.37-46 
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These two research papers from Harriman are directed to understanding the process of 
antibody class switching and their implication in IgA deficiency (abstract and 
introduction of Harriman et aL). For this purpose, an la knock-out mouse was generated 
by targeted deletion (Figure 1 of Harriman et al) and used in Qiu et al (figure 1 and 
page 38, 2"^ column, beginning of second paragraph which refers to Harriman). In this 
la knock-out mouse, a himian HPRT mini-gene driven by the PGK promoter and having 
a SV40 poly(A) signal replaces the la exon and proximal promoter located upstream of 
the switch region Sa, in the endogenous IgH constant region gene Ca. The knock-out 
mouse is generated by targeted deletion. Therefore, all the sequences 5' and 3' to the la 
exon which include Sji and C|ui in 5' and Sa in 3% are present in the knock-out mouse 
IgH locus. Harriman et al teaches that the la knock-out mouse produces mouse IgM. 
IgG and IgA at levels equivalent to those of the wild-typ e (page 480, 2"^ column, 2*^^ 
paragraph and p^^e 481, 1^^ column, end of first paragraph). Harriman et al teaches 
also that the la exon or transcripts containing the la exon are not required for IgA class 
switch but a second signal is required for the induction of IgA class switch (abstract). 
Qui et al discloses that a transcript of any sequence which is spliced across the switch 
region is necessary and maybe sufficient for CSR. 

' GenBankAC073553.5 

GenBank AC073553.5 discloses the sequence of a 187523 bp DNA segment fi-om 
mouse chromosome 12. 

Non-obviousness of claims Sd-S?, 59 over Green et al in vie^^^ of Qiu et al 

The rejection of claims 36-57 and 59 under 35 USC 103(a) over Green et al (US 
7,547,817) m view of Qiu et al (Int. Immunol., 1999, 11, 37-46) is respectfully 
traversed as the cited references do not disclose all the elements of the claimed 
invention, suggest the combination of elements of the claimed invention, or provide a 
reasonable expectation of success for the claimed invention. 
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Green was cited as differing from the present claims in that although Green may provide 
transgenic mice for producing any desired isotypes of human antibodies including IgA, 
the exemplified target IgH gene is not Ca (page 6 of the present OfSce Action). 

However this is respectfiilly imtrue. It is worth noting that the Examiner herself 
acknowledges that it is not true since it is noted on page 8 of the present Office Action 
that "/Ae combined teaching of Green in view of Qiu teaches a transgenic mouse with a 
non-cognative S region in its genome but not lacking a S}x '\ 

Green differs fix)m the present claims in that Green, at the very least, does not disclose 
or suggest any of the following features from the claimed invention: 

1. Green does not disclose a knock-in mammal generated by homologous 
recombination at the mammal endogenous IgH locus but a transgenic mammal 
(Xenomouse™) generated by random insertion of large germline fragments of the 
human IgH locus, into the somatic and germline cells of a mouse using YAC 
vectors (Column 5, Line 66 to Column 6, Line 3, Column 6, Lines 36-37;Colimm 7, 
lines 44-48; Column 9, Line 58 to Column 10, Line 42). 

Therefore, contrary to what is stated by the Examiner on page 6 of the present 
Office Action, Green does not disclose a targeting vector comprising 5' and 3' 
mouse flanking sequences for homologous recombination, which is introduced into 
mouse ES cells. 

2, Green does not disclose targeted DNA sequence replacement at the mouse 
endogenous IgH locus. Green discloses targeted DNA sequence replacement in 
yeast, on a human IgH YAC transgene comprising unrearranged variable region 
genes (V, D, J genes) Sji, C^i, C5, Sy2, Cy2, and a 4 kb fragment containing the 
mouse 3' enhancer element (column 12, line 36 to column 13, line 9, cited by the 
Examiner on page 5 of the present Office action, and figures 1 to 7). 
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3. Green does not disclose targeted DNA sequence replacement, wherein the 
endogenous switch sequence S\i is replaced with a transgene construct comprising a 
human heavy chain constant region gene Ca or a segment of the Ca gene 
comprising at least an exon encoding the CHS domain and a membrane exon. 

Green discloses targeted DNA sequence replacement, wherein: (i) the human Sy2 
switch sequence and the hiraian Cy2 coding sequence are replaced with the mouse 
Syl switch sequence functionally linked to a human CH coding sequence (e.g. 
human or the human Cyl), or (ii) the human Cy2 coding sequence is replaced by 
another human CH gene (column 6, lines 40-45 and 51-55; column 12, line 36 to 
column 13, line 9 (cited by the Examiner on page 5 of the present Office action) and 
figures 1 to 7). 

Therefore, contrary to what is stated by the Examiner on page 5 of the Office 
action. Green does not disclose the deletion of the mouse since Green discloses 
targeted replacement within a human inmiunoglobulin heavy chain (IgH) transgene. 
In addition, the targeted replacement disclosed by Green does not contain the 
replacement of Sji but of Cy2 alone or together with its switch region Sy2. 

4. Green does not disclose a human heavy chain constant region (CH) transgene which 
is not Imked to a switch region. Green discloses a human CH gene which is always 
functionally linked to an heterologous switch region (column 6, Lines 29 to 32 and 
colunm 12 cited by the Examiner on page 5 of the present Office Action). 

5. Green does not disclose the insertion of a human CH transgene between the intronic 
enhancer E|i and the C^ gene (figure 1 of the present application) but downstream 
of the C\i gene (figures 1 to 4 of Green). 

6. Green does not disclose a modified IgH locus that is incapable of isotype switching 
from IgM to the isotype of the downstream human CH transgene. Green discloses a 
human IgH transgene which is always functionally linked to an heterologous switch 
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region and capable of isotype switching from the human IgM to the isotype of the 
downstream human CH gene (column 6, lines 26 to 29 and column 12 cited by the 
Examiner on page 5 of the present Office Action). 

7. Green does not disclose a transgenic mammal which produces no IgM and high 
level of chimeric human IgAs in which the heavy chains comprise a himian 
immunoglobulin A constant region and a variable region from the non-human 
mammal. Green discloses a transgenic mammal which produces human IgM and 
human Ig of the desired isotype. 

Qiu was cited as disclosing, on figure 1, a transgenic mouse whose endogenous switch 
region and Ca region was replaced with a human Sa and Ca (page 6 of the present 
Office Action). However this is respectfully untrue. As mentioned above in paragraph 
3.2.3, Qiu discloses an la knock-out mouse . In this mouse, the endogenous switch 
region and Ca region are not modified and there is no human Sa and Ca insertion 
(figure 1 of Qiu or Harriman which both refer to the same mouse). 

One of ordinary skill in the art would have no reason to combine Green with Qiu 
because Qiu does not disclose or suggest any element of the claimed invention which is 
missing from Green. Qiu discloses an la knock-out mouse that produces mouse IgM, 
IgG and IgA at levels comparable to the wild-type, as discussed above. 

Even, if the ordinary skilled artisan were to combine the disclosure of Green with that of 
Qiu, the ordinary artisan would modify the mouse described by Green by linking the 
heterologous switch region functionally to an HPRT minigene as taught by Qiu to arrive 
at a transgenic mammal containing an unrearranged human inraiunoglobulin heavy 
chain (IgH) transgene containing VH genes, all the D elements, all J elements, Sji, C|i. 
and C8 from human chromosome 14, and a chhneric human IgH constant region gene in 
which human constant region gene sequences encoding the desired heavy chain isotype 
are operably linked to an heterologous switch region which is operably linked to an 



13 



Application No. 10/577,061 



Attorney Docket No.40521U 
Customer No.: 50438 



HPRT minigene, ie. a transgenic mamma! which is even more different from the 
claimed transgenic mammal than the transgenic mammal of Green. 

Furthermore, the prior art and the common knowledge at the time of filing of the present 
application (Annex I) , in fact, teach away from the present invention. It is known in the 
art that B cell maturation, in v/vo, requires the expression of membrane IgM on B cells 
surface and that the production of a selected antibody isotype, in vivo, requires the 
development of IgM producing B cells which subsequently undergo isotype class 
switching from IgM to the IgG or IgA, or IgE isoptypes (column 10, lines 43 to 60 of 
Green and Annex I). As discussed above, the mice which are disclosed in Green, Qiu 
and Harriman (which refer to the same knock-out mouse as Qiu) have IgM producing B 
cells which subsequently undergo isotype class switching from IgM to the IgG or IgA, 
or IgE isoptype. Therefore, to produce human Ig of the desired isotype (IgG, IgA,) one 
of ordinary skill in the art would never have engineered mice which do not produce IgM 
but only Ig of another isotype (IgG, IgA). In accordance with the claimed invention, the 
present inventors have demonstrated that, in the absence of IgM expression, manmdals 
can surprisingly develop frmctional B cells able to generate an immime response. 

Therefore, even the combined disclosures of the cited references would fail to disclose 
all the elements of the claimed invention, or suggest this combination or provide a 
reasonable expectation of success for the present invention because numerous elements 
from the claimed invention are missing from these reference disclosures. In fact, these 
references actually teach away from the present invention. 

Hence, this ground of rejection is unsustainable and should be withdrawn. 

Non-obviousness of claim 58 over Green et al in view of Qiu et al and GenBank 
AC073553.5 
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Claim 58 stands rejected under 35 USC 103(a) as being impatentable over Green et al 
(US 7,547,817), in view of Qiu et al (Int. Immunol., 1999, 11, 37-46) as applied to 
claims 36-57, 59, and further in view of GenBank AC073553.5. 

This rejection is not sustainable over the combination of Green and Qiu for the reasons 
discussed above. However, GenBankAC073553.5 does not remedy the deficiencies of 
Green and Qui. 

GenBank AC073553.5 discloses the sequence of a 187523 bp DNA segment from 
mouse chromosome 12. 

However, GenBank AC073553.5 does not disclose the specific fragments 
corresponding to SEQ ID NO: 7 and SEQ ID NO: 8 of the invention, nor the 
combination of said specific fragments with a human Ca gene or a fragment of said 
gene comprising the CH3 domain and membrane exons. Furthermore, there is no 
suggestion of such a combination, nor does this reference provide a reasonable 
expectation of success the same because the conventional knowledge at the time of 
filing of the present application actually teaches away from the claimed invention for 
the reasons discussed above. 

Accordingly, this rejection is unsustainable and should be withdrawn. 

Non-obviousness of claims 36 and 52 over Green et al. in view of Qiu et al and 
Harriman et al 

Claims 36 and 52 stand rejected under 35 USC 103(a) as being unpatentable over Green 
et al (US 7,547,817), in view of Qiu et al (Int. Immunol, 1999, 11, 37-46) as applied 
to claims 36-57, 59, and fiirther in view of Harriman et al (J. Clin. Invest., 1996, 97, 
477-485). 
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Harriman was cited as establishing that it was known in the art that the S^i region is not 
required for IgA class switch, and that a mouse without the genomic Sji region still can 
produce IgA antibody. 

However this is not true for the reasons discussed above. Harriman and Qiu disclose the 
same la knock-out mouse. In this knock-out mouse, generated by targeted deletion, the 
only region of the endogenous IgH locus which is deleted is the la exon (figure 1 of 
Harriman). Therefore, all the sequences 5' and 3' to the la exon which include S(j, and 
Sy in 5' and Sa in 3' are present in the la knock-out mouse IgH locus. Harriman teaches 
that the la knock-out mouse produces IgM, IgG and IgA and that the IgA are produced 
by class switching, i.e., by recombination between the Sp. and Sa, or Sy and Sa regions. 

Therefore, Harriman does not disclose or suggest that the Sp. region is not required for 
IgA class switch, and that a mouse without the genomic S|x region still can produce IgA 
antibody. 

Importantly, Harriman fails to disclose all of the elements missing from the two primary 
references, or suggest this combination or provide any reasonable expectation of success 
for the present invention. In fact, the combined disclosures of the cited references would 
teach away fi'om the claimed invention for the reasons noted above. 

Hence, this ground of rejection is unsustainable and should be withdrawn. 

Claims 36-5 1 and 59 stand rejected under 35 USC 112, first paragraph. 

However, the rejection of claims 36*51 and 59 under 35 USC 112, first paragraph, as 
failing to comply with the written description requirement is respectfiilly traversed as 
the IgH locus sequences required for practicing the claimed non-human manmial are 
described in the specification in such a way as to reasonably convey to one skilled in the 
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relevant art that the inventors, at the time the application was filed, had possession of 
the claimed invention. 

The practicing of the claimed invention does not require all the sequences of the IgH 
locus but only a very small portion of sequences (less than 10 kb) flanking the Sfj. 
region, the targeted site for homologous recombination (see figure 2). 

These sequences which are necessary and sufficient for practicing the claimed invention 
are defined by specific features (size, well-known protein domains, i.e., JH/Ep. region, 
C\i gene) which are disclosed in the specification (see figure 2). 

The specification provides also the positions of these sequences on mouse chromosome 
12 sequence (Genbank/EMBL AC073553). 

At the date of filing of the present application, the IgH gene locus had been mapped in 
different mammal species and its entire sequence or part of its sequence comprising the 
JH/E|j, region and Cfi gene sequences were available for numerous species, including at 
least human, mouse, rat, sheep, cattle, dog, cat, rabbit, hamster, shrew and pig 
(Schrenzel et aL, Immunogenetics, 1997, 45, page 386, 2^^ column, end of last 
paragraph; Annex II). Examples of GenBank accession numbers are: AYl 58087 
(bovine), M13800.1 (rat) and X02804.1 (hamster), shown in Annexes III to V. 

Furthermore, using the mouse sequences that are described in the application, one 
skilled in the art would have been able to obtain the corresponding sequences fi"om other 
mammals, using standard molecular biology and/or sequence analysis techniques which 
were routine in the art at the date of filing of the present application. 

Therefore, this ground of rejection is clearly moot. 

Claims 36-51 and 59 stand rejected under 35 USC 112, first paragraph. 
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However, the rejection of claims 36-51 and 59 under 35 USC 112, first paragraph, as 
failing to comply with the enablement requirement is respectfully traversed as the 
specification does enable any person skilled in the art to which it pertains, or with it is 
most nearly connected, to practice the invention commensurate in scope with these 
claims. 

First of all the essential materials (IgH sequences) required for practicing the claimed 
invention are adequately described by the instant disclosure for the reasons mentioned 
above. 

The specification describes how the claimed transgenic mammal can be obtained by 
homologous recombination with an appropriate targeting vector (see page 11, lines 4- 
11, page 11, line 31 to page 12, line 14) and provides a working example of the 
invention (transgenic mouse line of example 1). 

The state of the art and the level of skill in the art is such that pronuclear microinjection 
of fertilized eggs and the use of in vitro embryo production in combination with gene 
transfer technology are available for the mammal species for which ES cells were not 
available (Mullins et al^ J. Clin. Investigation, 1996, 97, 1557-1560; in particular page 
1557, second column, end of paragraph entitied '^Transgenesis by pronuclear 
injection''). 

The inefficiency of pronuclear microinjection due to random integration of the 
transgene is not a relevant problem for the present invention since the transgene is 
integrated by homologous recombination in a functional locus. Mullins et al mentions 
clearly that targeted homologous recombination improves the inefficiency of pronuclear 
microinjection (see page 1558, first column, middle of first paragraph: "/w many cases, 
simply including large amounts of flanking sequences may be sufficient to overcome 
position effects and direct expression to specific tissues''). Therefore, pronuclear 
microinjection is not an unpredictable technique in the case of the present invention 
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since it allows efficient integration of the transgene at the nuunmal IgH locus and 
subsequent expression of this transgene. 

Furthermore, Mullins et al point out that the use of in vitro embryo production in 
combination with gene transfer technology is a major improvement to pronuclear 
microinjection of fertilized eggs because transgene screening and cloning can take place 
before reintroduction into the natural host. MulUns et al. mentions that the microinjected 
embryo technology allows efficient production of transgenic cattle, rabbit and sheep. 

Therefore, one skilled in the art with the teaching of the specification and the 
knowledge in the art would be able to practice the claimed transgenic mammal without 
undue experimentation. 

TTierefore, this ground of rejection is deemed to be moot. 

Claims 36-51 stand rejected under 35 USC 1 12, second paragraph. 

However, the rejection of claims 36-51 are believed to be obviated by the amendments 
which were made to claims 36, 46 and 48. 



Therefore, withdrawal of the rejection is respectfully requested. 
The specification stands objected to. 

In view of above amendment to the specification, this ground of rejection is moot. 

Finally, Applicants emphasize that even assuming for the sake of argument that one 
skilled in the art would have had motivation to combine the cited references of record, 
the artisan still would not have been put in possession of tiie claimed invention. 
Moreover, KSR v. Teleflex, 550 U.S. 398 (2007), is of no moment to the patentability 
of the claimed invention since in the present case one skilled in the art could not have 
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relied upon ''common sense"' to find implicit motivation to combine the cited references 
for at least three reasons. 

First, even the combined reference disclosures lack the full measure of claimed genetic 
elements. That is, the combined teachings fall short of the claimed invention. 

Second, the combined reference disclosures actually teach away from the claimed 
invention. Unlike in KSR, where the U.S. Supreme Court found nothing in the prior art 
that would teach away from the Asano reference (used to deteraiine the obviousness of 
the claimed invention), on the present record there is cited prior art that teaches away 
from the claimed invention. 

Third, the claimed invention does not constitute a known problem for which there is a 
known solution. In fact, to the contrary, the cited references strongly indicate that the 
presently claimed 'solution' was anything but known, and, thus patentable. 

Guidelines for U.S. Patent Examiners are set forth at MPEP 2143, Eighth Edition 
(Revision 6, September, 2007). Notably, "Exemplary Rationales" A-G are set forth 
where an examiner might justifiably cite the KSR decision in support of an obviousness 
rejection, hi fact, none of these rationales apply to the claims at issue. This is striking 
masmuch as the KSR test for determining obviousness is clearly broader than the 
previously (and solely) used 'teach, suggest, motivate' (TSM) test. Yet, even an 
attempted application of the KSR test indicates patentability for the claimed invention 
based upon the prior art of record. 

Annexes I-V are attached to this Amendment m general support of the patentability of 
the claimed invention. As such, they have not been cited in an Information Disclosure 
Statement in accordance with standard practice. 
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Accordingly, in view of all of the above amendments and related remarks, it is believed 
this application is now in condition for allowance. Favorable consideration and early 
notice to this effect are eamestly solicited. 



Respectfully submitted. 




William E. Beaumont 
Reg. No. 30,996 
Juneau Partners, PLLC 
Customer No.: 50438 



* See Annexes I-V attached to this amendment 
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THIRD EDITION 

Ivan Roitt • Jonathan Brostoff • David Male 




Birds 

In contrast lo sharks, chickens possess extremely iimii- 
ed numbers of genes coding for immunoglobulins. In 
ihe light chain system there is only one V, one J and 
one C segment gene. The heavy chains are similarly 
resiriacd with single V and J segments. Altiiough there 
are about 15 Dh segments, these are all very similar in 
sequence and add HiUe to diversity. Despite this severe 
limitation cliickens are perfectly able to mount a wide 
range of antibody responses and produce sequentially 
diverse antibody tnoiecules. 

Upstream of the Vl gene is a region containing 25 
sequences similar to Vl regions^ but each lacking a 
leader exon and a promoter region. They also lack the 
characteristic heptamer-spacer-nonamer sequences 
needed for V-J rearrangement. These pseudogenes are 
not wasted but are used in a process of gene conver- 
sion» with sections of the pseudogene being inserted 
into the viable Vi, region. Hiis is a continuous process 
wiiich carries on after the B cells have left the bursa 
and multiple conversion events can occur during the 
lifetime of the B ceil. 



Mutations In the DNA of two Vh TIS genes 




Rg. 5.16 DNA of two anti-phosphorylcholine antibodies 
^: the T15 idiotype. (Black lines Jndicate positions. where 
•i;> genome has mutated from the flerm line seqUiEinice.) 
;i M^-^ '^^9* numbers of mutations In the introns ?ind 
^ ^i*^^®*^^ °' ^^^^ genes, but particularly in the second 
M region, HV2. By comparison, no mutations 

1^?^®^*^*^^® 96*^^5 coding for the constant regions. . 



Five mechanisms for the generation 
of antibody diversity 



^?v, , ^ • multipift qerm »nft V q aha* 


j>„- ■ — — — 

^ 2. V-J and V-O-J recombinations 




_^ 3. recombinational inaccuracies 




4. somatic point mutation 




5. assorted heavy and light chains 



C^^ince each mechanism can occur with any of the 
' I? potential for increased diversity multiplies et 
P|!>f immunoglobulin production. 



Similar processes occur with the heavy-chain gene 
loQiS. wliere up to a 100 Vh pseudogenes act to increase 
the diversity by similar conversion mechanisms. 

Rabbits 

Rabbit immunoglobulins iiave always presented a puz- 
zle, particularly in the way in which allotypes are regu- 
lated. Although the rabbit lias many Vh genes, the Vh 
gene nearest to the D segment is used in most rabbit B 
cells. Recent evidence suggests that the rabbit may also 
use a gene conversion mechanism to diversify this sin- 
gle Vh gene. 

PSEUDOGENES IN HUMAN DIVERSIFICATION 

Several V and J segment genes are also in the form of 
pseudogenes. Whether gene conversion is involved in 
generating human V regions is a matter of speculation 
and interest. 



HEAVY CHAIN CONSTANT REGION GENES 



All classes of immunoglobulin use the same set of vari- 
able region genes. When the class is charged, by a B 
cell iliat has matured into an antibody*fbrming cell and 
is therefore committed to a particular antigen, all that is 
switched is the constant region of the heavy chain. Tliis 
is also shown by the analysis of double myelomas, 
where two monoclonal antibodies are present in the 
serum at the same time. IgM and IgG antibodies from a 
patient with multiple myeloma have been found to 
have identical light chains and Vh regions; only the 
constant regions were switched from \i to 7. Often IgM 
and IgD are found on tlie lymphocyte surface at the 
same time. Capping these i^eceptors with antigen has 
revealed that the IgM and IgD from the same B cell 
have the same specificity for antigen, indicating similar- 
ity of Vh regions on the two classes (Fig, 5,18), 

All the constant region genes are arranged down- 
stream from the J segment genes. In the mouse there is 
one gene for each of the fji, € and ft isotypes and one 7 
gene for each of the four different IgG isotypes (Fig. 
5. 19). In man the constant region genes are more com- 
plicated, and it appears that one section of this region 
has undergone gene duplication and diversification. In 
man the oixlering of genes is jx, 8, [73, 7I. €l al), 7, 
I72, 74, €, a2]. The two sets witiiin square brackets 
indicate the possible area of reduplication. The genes 
€l and 7 are pseudogenes and are not expn*ssed. Just 
upstream (50 to the \l genes is a switch sequence (S) 
which IS repeated upstream (50 to each of the other 
constant region genes (Fig. 5.20). Class switching is 
important in the maturation of the immune response 
and may be accompanied or preceded by somatic 
mutation. Initially a complete section of DNA. including 
the recombined Vh region through the 8 and \l constant 
regions, is transcribed; then by differential splicing, two 
mRNA molecules are produced, each with the same Vh 
but having either p. or 6 constant regions. It is suggested 
that sometimes much larger stretches of DNA are also 
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capping of IgM and tgO with antigen 




fi9. S.18 Some B cells have both \gM and laD on .h. 
sur^ce (1,. This can be demonstrated biutaZ \,T 
-.th rhodaminated ami-iglW (red) and f lores ^JJiV'"' 
99 SgT ""^ a„«S t"''- 
?oT9Cre dl^^^^^^^^ 

c<»l/ 19\ « *k ""/-"'y separate red and green caps on 

appear together on the cell, that is. they co can Thic 
imphes that the IgM and IgO on the cell su,f^«c 
'inked by antigen t4,. This^could only o cur .h.^L"""- 
90 had the same antigen-binding sp'e 'n, t 



Rg- 5.19 The constant region genes of 
me mouse are arranged 6.5 kb 
downstream from the the recombined 

rl?l ^^^^ ^ (except 

(.«) has one or more switching 
sequences at its start (red circies) 

which correspond to a sequence at the 
5 end of the „ gene. This allows any 
01 the C genes to recombine with 
V-O-J. 6 genes appear to use the same 
s witchmg seq uences as ,1 but the ii 
gene iranscripi is lost in RNA 
processing to produce IgD. The C 
genes (expanded below for ^ and y29i 



Constant region genes in the mouse 
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also have a separate exon codU fo" Z'ZZV'T 

8 >or membrane-bound immunoglobulin (M). 
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Membrane and Secreted Immunoglobin 



Isotype switching by differential RNA splicing 
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Fig. 5.21 Single 8 cells produce more than one antibody 
isotype from a single long primary RNA transcript. A tran- 
script containing n and 6 is shown here. Polyadenylation 
can occur at different sites (black), leading to different 
forms of splicing, producing mRNA for IgD (top> or IgM 
(bottom). Even within this region there are additional 
polyadenylation sites which determine whether membrane 
immunoglobulin or secreted immunoglobulin is formed. 



Lost genes 



chromosome A 
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fig. 5.22 A and 8 are chromatids of the chromosome 
section of the Ig genes. A contains the rearranged V-D-J 
segn^ent. in the looping out hypothesis, a section of C 
Se^es 6. v3) loop out and are lost. In the chromosome 
e>«change hypothesis, similarities in the switching 
sequences permit unequal somatic recombination between 

aiernal and paternal chromosomes. The A chromosome 
^ecombines with another part of the unrearranged B 
loM T^?""^' '^^^ ®^ 9®"® segments gives rise to the 
^Tl ^^^'"'^ ^' "^^^ ''0^^' C Senes are on 

Which ' chromosome B' <not shown), 

now contains two copies of several C genes. 



trnnscribecl together, with differential splicing* ^ivin^? 
other imnuinOf»lobulin cbsscs sharii\i4 Vii regions (Tig. 
5.21). This lias been observed in cells simulianeously 
pKXiiicinK IgM and IgH. More often, class swiicliing 
appears to be mediated by a recombination i:)eivveen S 
recombination sites, allowing a K-x>pini; out and dele- 
tion of DNA and bringing anoiher C reigion dose lo the 
VDJ gene CFig. 5.22). A further |x>ssibiliiy has been sug- 
gested inx'olving exchange Ixnween chrtimosonies. 



m$ MEMBRANE AND SECRETED 
IMMUNOGLOBULIN 

Membrane imaiunogldntlin (antigen receptor) Is identi- 
tral to secreted immunoglobulin (antibody), e.NcejM for a 
.stretch of amino acids at the C terminus of each )iea\y 
chain. Membrane immunogiobulins are larger than their 



Membrane and secreted fgM: mouse 
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' bridge 



Fig 5,23 C-terminal amino acid sequences are shown for 
both secreted and membrane-bound IgM, and are identical 
up to residue 556. Secreted IgM has 20 further residues. 
Residue 563 (asparagine) has a carbohydrate unit attached 
to it while residue 57S is a cysteine involved in the 
formation of interchain disulphide bonds. Membrane IgM 
has 41 residues beyond 556. A stretch of 26 residues 
between 568 and 595 contains hydrophobic amino acids 
sandwiched between sequences containing charged 
residues. This hydrophobic portion may traverse the cell 
membrane as two turns of a helix. A short* positively- 
charged section lies inside the cytoplasm. 
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Such interactions induce ihe production of the cyto- 
kines IL-1. IL-3. and GM-<:SF which are required 
for T-cell maiuration in the thymus. Thymocytes also 
express receptors for lL-2 and this cytokine. Together 
with other molecules, promotes cell proliferation which 
mainly occurs in the subcapsular and deep conex, 

Neflative selection in the periphery (peripherai tolerancej 

Not all self-reactive T cells are eliminated during 
iniraihymic development, probably due to the inability 
of all self-antigens to transit through the thymic tissues. 
The thymic epithelial barrier may also limit access to 
some circulatory antigens. Given the survival of some 
self-reacting T cells, a separate mechanism is required 
to prevent their attacking the body. Recent experiments 
with transgenic mice liave suggested that peripheral 
inactivation of self-rcaaive T cells can have two causes: 

1. down-regulation of TCR and CDS (in cytotoxic cells) 
so that the cells are unable lo inieraa with target 
auioantigens 

2. anergy. due to the lack of crucial secondary activa- 
tion signals provided by the target cells. 

Peripheral tolerance is discussed in more detail in 
; Chapter 10. 

^4 Extrathymic T cell development 

^ Although the vast majority of T cells require a function- 
ing thymus for tlieir differentiation, small numbers of 
W cells (often oligoclonal in nature) carrying T cell mark- 
have been found in aihymic CnudeO mice. The pos- 
^|a1>ility that there are thymic remnants cannot, however, 
Ibe niled out. There is some experimental evidence sug- 
gesting that bone marrow precursors can home to 
Jnucosal epithelia and then mature to funaional TCR-1 
"^T cells, without the need for a thymus. 



J CELLS 

In the chicken, primary B-cell lymphopoiesis occurs in 
|: discrete lympho^pithelial organ, the bursa of 
libricius. The bui-sal nidiment develops as an outpush- 
' _ of the hindgut endodemi and becomes seeded with 
lood-borne stem cells. Studies on chicken/quail 
leras have indicated that there is a window for the 
ngraiion of stem cells into the bursa between days 
[0 and 14 of embryonic life (see Chapter 14). 

jninophylic cells - the putative stem cells - are seen 
J/coniaa with epithelial cells. Bursal ceil proliferation 
^es rise to the cortex and the medulla in each bursal 
||icle, which may be seeded by one or a few stem 
fs(Fig. 11.12). 

|Mammals do not have a specific discrete organ for 
"^11 lymphopoiesis. Instead, these cells develop 
|ctly from lymphoid stem cells in the haemopoietic 
% of the fetal liver (Fig, 11.13) from &-9 weeks of 
[ation in humans, and by about 14 days in the 
jse. Later the site of B-cell production moves from 
pver to the bone marrow, where it is continued into 
% life. This is also true of the other haemopoietic 
pes, giving rise to erythrocytes, granulocytes, 
"locytes and platelets. Recent data have indicated 



that B-cell progenitors are also present in the omental 
tissue of murine and human fetu.ses. Whether or not 
these B-cell progenitors precede those in the fetal liver 
remains to be established. 




bursal folds 



Fig. 11.12 Section of a bursa showing B cells developing 
in foHide$. Like the fetal liver, the bursa is a site of spme- 
granuloqytppoiesis, not just a site for. 
lymphocyte development. H&E stain, x50. 




hepstoeytes 




islands of 
r^^S^:^^^ haemopoietic 



Fig. 11.13 Section of human fetal Ihrer showing islands of 
haemopoiesis. Haemopoietic stem cells (HSCs) give rise to 
islands of differentiating lineage-specific cells, including 8 
cells. 



B-celJ production in the bone marrow does not occur 
in distinct domains. However, it has been shown that 
B-cell progenitors are adjacent to the endosteum of the 
bone lamellae. Each B-cell progenitor, at the stage of 
immunoglobulin gene rearrangement, may produce as 
many as 64 progeny. These migrate towards the centre 
of each cavity of the spongy lx)ne and reach the lumen 
of a venous sinusoid. In the bone marrow, B cells 
mature in close association with stromal leticuiar cells 
The latter are found Ixith adjacent to the endosteum 
and in close association with the central sinus, where 
they are termed adventitial reticular cells (Fig. ll.H). 
Reticular cells liave mixed phenoiypic features with 
some similarities to fibroblasts, endothelial cells and 
smooth muscle cells. They produce type IV collagen, 
lammin and the smooth-muscle form of aciin. 
Experiments in vitro have shown that reticular cells sus- 
tain B-cell differeniiaiion. Adventitial reiiailar cells may 
Ixr important for the release of mature B cells into the 
central simts. The majority of B cells (over 75%) maior- 
mg in the tone marrow do not reach the circuJation 



but (as in the thymus) undergo a process of nro. 
grammed cell death or apoptosis. and are phagocv ^- 
losed by bone nnarrow macrophages, h has bee ^ 
suggested that B-ceil-stromal interactions may mediate 
a form of positive selection that rescues a minority of B 
cells with productive rearrangement of their 
immunoglobulin genes from programmed cell death 
Negative seleaion of autoreactive B cells may occur in 
the bone marrow or in the spleen, the site to which the 
majority of newly produced B cells are exported. 

From kinetic daui, it is estimated that about 5 x lo? 
murine B cells are produced per day. Since the mouse 
spleen contains approximately 7.5 X 10^ B cells, a laige 
proportion of B cells must die. probably at the pre^B- 
cell stage stage where they outnumber the B cells in 
the marrow by a factor of two. 

The characneristic markers of the B-ceil lineage are 
immunoglobulins, which act as cell^surface antigen 
receptors. Lymphoid stem cells (probably expressing 
terminal deoxynucleotidyl transferase, Tdt) proliferate 
and differentiate and then undeigo immunoglobulin 





Fig. 11.14 Left: Low power scanning electron micrograph 
showing the architecture of bone and its relationship to bone 
marrow. A cavity has been picked out and is drawn 
schematically on the right. (Courtesy of Ors A. Stevens and J. 
Lowe). Right: Within the cavities of spongy bone, 6 cell 
lymphopoiesis takes place with maturation occurring in a 
radial direction towards the centre (from the endosteum to 
the central venous sinus). Immature progenitor cells 



adjacent to the endosteal cell layer mature into pre-B cells, 
most of which die and are phagocytosed by bone marrow ' 
macrophages containing tingible bodies (stained by 
haematoxylin). Cells which survive mature further and 
reach the central venous sinus. Association with reticular 
cells, and the presence of cytokines such as IL^7 is essential 
for all steps of B cell maturation. (Adapted from Osmond 0, 
Gallagher ft. tmmunoi Today 199 1:12; 1-3,). 
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gene rearrangemenis (see Chapier 4). Following this, 
they emerge as pre-B cells which express jx heavy 
chains in the cyroplasni. Ai least some of the pre-B cells 
exjxess small amouni.s of surface ^4, chains associaiecl 
with pseudo lighi cliains VpreB and \5. Allelic exclu- 
sion of either maternal or paternal unmunoglobulin 
genes has already occurred by this lime. The proliferat- 
ing pfe-B cells are thought to give rise lo smaller pre-B 
cells. On synthesis of light chains, which may \x either 
of K or \ type, but not both, the B cell is then commit- 
ted to the antigen- binding specificity of its sIgM ami- 
gen receptor. Thus one 8 cell can make only one 
specific antibody, a central tenet of the clonal selection 
tlieory for antibody production. A summary of B-cell 
differentiation, with expression of immunoglobulins 
and other relevant molecules, is shown in Fig. 11.15. 

A sequence of immunoglobulin gene reanxmgements 
and phenoiypic changes takes place during B-cell 
ontogeny similar to that described for T cells (see 
above and Chapter 5). Heavy chain gene rearrange- 
ments occur in B-cell jjragenitors and repi-eseni the earli- 



Lymphoid Cells 



est indication of B-lineage commitment. This is fol- 
lowed by light chain gene rearrangements which occur 
ai later pre-B-cell stages. Certain B-ceil surface markers 
are expressed prior to immunoglobulin detection, 
namely dass U MHC moleciiles. CD39. CD20, CD21 and 
the CDIO (CAUA) antigen. The latter marker is a highly 
conserved neutral endopepiidase which is transiently 
expressed on early B progenitors Ixfore the appear- 
ance of heavy ^ chains in the q^toplasm. CALLA is re- 
expressed later in the B-cell life history, following 
aaivation by antigen (see Fig. 11.15). Other markers 
such as CD23 and CD25 (lL-2 receptor a) are mostly 
foimd on activated B cells. 

A number of growth and differentiation factoi-s are 
required to drive the B cells through early stages of 
development, Receptoici for these faaors are expressed 
at various stages of B cell differentiation. IL-7. IL-3 and 
low molecular weight B-cell growth factor (L-BCGF) 
are important in initiating the process of B-cell differen- 
tiation whereas other factors are active in the later 
stages (see Fig. 11.16). 



B ceil differentiation 




MS B cells differentiate from lymphoid stem celts 
|v«rflin B ceils and may then be driven by antigen to 
|rne memory cells or plasma cells. The cellular location 
|munogiobulin is shown in yellow. The genes coding 
pi*>ody are rearranged in the course of progenitor cell 
|opment. Pre-B cells express cytoplasmic p chains 
^ immature 8 cell has surface IgM, and the mature 
]»8r immunoglobulin isotypes. On antigen 
^.ation the B cell proliferates and develops into a 



plasma cell or a memory cell following a phase of 
proliferation, activation and blast transformation. Memory 
cells and plasma cells are found at different sites in 
lymphoid tissue. Tdt is expressed very early in ontogeny. 
The diagram also shows the sequence of appearance of 
other important B cell surface markers. PCA-1 is found only 
on plasma cells. Note that C036 is an example of a 
molecule found on early progenitors that is lost, only to 
reappear on the fully differentiated plasma cells. 



Following their production in the fetal liver. B cells 
migrate and funaion in the secondary lymphoid tissue 
Eariy immigrants into fetal lymph nodes (17 weeks in 
man) are slglVl* and cany a T-cell marker (CDS). CD5+ 
B cell precursors are found in the fetal omentum Small 
numbers of CD5* B cells are also found in the mamle 
zone of secondary follicles in adult lymph nodes. 

Following antigenic stimulation, mature B cells can 
develop into memory cells or antibody-forming cells 
(AFCs). Surface immunoglobulin (sig) is usually lost by 
the plasma cell (the terminally differentiated form of an 
AFC), smce its function as a receptor is finished 
Immature and mature B cells respond in different ways 
to antigens. Treaunent with anti-lgM antibodies or anU- 
gen r«ults in loss of sIgM by capping and endocytosis 
m both mature and immature B cells. However only 
mature B cells nssynthesize sIgM in culture (Fig l l 17) 
Smce immature B cells can be induced to lose their 
antigen receptor, this could be one mechanism by 
which self-reaaive B cells are rendered toleram during 
developmem. * 



g|0«VERSITy OF ANTIBODY SPECIFICITY 

TTiere are tens of thousands of natural antigenic shapes 
Since one B cell can make only one antigen-specilic 
antibody, many B cells with different specific antibody 
receptors have to be generated from the stem cells. 

The genes encoding the variable regions of antibody 
molecules are described in detail in Chapter 5 The 
variable region genes comprising V, D and J segments 
are present in every somatic cell in a genn-line confieu- 
ration. DtJring early development inten-ening sequences 
between D and J are deleted, bringing these genes clos- 
er togeUier. Further rearrangements of the V. D and 1 
segments of the x'ariable region heavy chain genes (Vh) 



occur dunng the progenitor stage of B-ceU devel^ ? 
tnent (see Fig. n.15). The various combinatSmTr 
these genes (one productive rearrangement per cem'^ 
are expressed with p. heavy chain genes in the 
plasm of the large pre-B ceU. These actively prolife^T 
mg pre-B cells then rearrange their Vk genes. anS 
their V\ genes, if the k rearrangement has not be^^ 
successful. When a light chain gene is produaS ^ 
rearranged the immature B cell expresses surface I 
chains with either k or X light chains. Those cells n^ l 
making productive rean-angements probably die (ner ^ 
haps by apoptosis). This is one explanation as to whJ - 
so many pre-B cells die during development (see '! 
atx>ve). There is some evidence that pseudo-ljght-chain ' 
genes are expressed prior to k and X light chains, and • 
that these may assemble small amounts of surface leM 
on pre-B cells. This might be important in selection^ I 
eariy pre-B cells. ' 

Once die K or X light chains are being produced the 
surf^ace IgiM on the immature B cell can act as a func- 
tional antigen receptor. The rearrangements of the V D 
ajid J segments (heavy chains) and V and J segments 
(light chains) are thought to be randomly generated 
withm the B cells. However, there is evidence in mice 
rats and chickens for a programmed sequence of devel-' 
opment of specific antibody specifidties (Fig 11 ig) 
Antibody production, as distina hom antigen recogni- 
tion by B cells is. however, dependent on both T cells 
and APCs. 

The reason for this programmed development of 
specificities at the molecular level within B cells is 
unclear, but it may reflea the biased uUlization of V 
gene segments nearest to ihe .D or J segments, with a 3' 
directional movement of the relevant recombinases 
and/or some negative selection of particular clones 
(possibly for self-reactivi«es). Many of the fim B cdls 
to appear in ontogeny express a predominantly T cell- 
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Rg. 1 .16 The whole life history of B cells from stem cell 
0 mature plasma cell is regulated by cytokines present in 
their environment. Receptors for these cytokines are 



setectively expressed by B cells at different stages of 
development. IL-7 plays an Important role in inmatino 
events in B cell differentiation. 
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Abstract Horse {Equus caballus) immunoglobulin mu 
chain-encoding (IgM) variable, joining, and constant gene 
segments were cloned and characterized. Nucleotide se- 
quence analyses of 15 cDNA clones from a mesenteric 
lymph node library identified 7 unique variable gene seg- 
ments, 5 separate joining segments, and a single constant 
region* Based on comparison with human sequences, horse 
variable segments could be grouped into either family 1 of 
immunoglobulin (/g) clan I or femiiy 4 of /g clan II subclan 
IV. All horse sequences had a relatively conserved 16 base 
pair (bp) segment in fiameworic 3 which was recognized 
with liigh specificity in polymerase chain reaction by a 
degenerate oligonucleotide primer. Horse complementarity 
determining regions (CDR) had considerable variability in 
predicted amino acid content and length but also included 
the presence of relatively conserved residues and several 
canonical sequences that may be necessary in formation of 
the p chain main structure and conformation of antigen- 
binding sites through interaction with light chain CDR. 
Sequence analysis of joining regions revealed die presence 
of nearly invariant 3' regions similar to those found in 
human and mouse genes. A single horse IgM constant 
region comprising 1472 bp and encoding 451 residues 
was also identified. Direct comparison of the horse constant 
region predicted amino acid sequence with those from 
eleven other species revealed the presence of 53 invariant 
residues with particularly conserved sequences within the 
third and fourth exons. Phylogenetic analysis using a 
neighbor-joining algorithm showed closest similarity of 



The nucleotide sequence data reported in this paper have been 
submitted to GSDB and have been assigned the accession numbers 
L81 155 (clone Igl); LSI 156 (clone !g51); L8U57 (clone 114); L81 158 
(clone 116); L81I59 (clone 118); L8n60 (clone Igl9); L81161 
(clone Ig20), and L49414 (mu chain-encoding constant region) 
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the horse mu chain-encoding constant region gene to 
human and dog sequences. Together, these findings provide 
insights into the comparative biology of IgMos well as data 
for additional detailed studies of the horse immune system 
and investigation of immune-related diseases. 



Inununoglobulins (Ig) constitute the major functional pro- 
tein of himioral immunity and are comprised of heavy and 
light chains, each containing variable and constant genes 
(Honjo 1983; Chothia and Lesk 1987; T\itter and Riblet 
1989). Ig heavy chain variable domains are generated by 
the recombination of discontinuous germline variable (Igh- 
V)y diversity {Igh-D), md joining (Igh-J) segments. Within 
the Igh'V molecule, differences in peptide sequences are 
clustered in diree intervals, termed hypervariable or com- 
plementarity determining regions (CDR), which are sepa- 
rated by conserved framework regions (FW). The CDR 
comprise the antigen-binding portions of immunoglobulins, 
while the FW provide structural stability to allow proper 
association of light and heavy chain heterodimers (Kirkham 
et al. 1992). The heavy chain constant genes define the 
different Ig subclasses as well as aspects of their inmiuno- 
logic functions and have been named IgM, IgD, IgG, IgA, 
and IgE. Among these, the IgM molecule appears to be the 
most primitive, since it was the fust to appear in vertebrate 
evolution (phylogeny) and is the first Ig expressed during 
ontogray of the immune system (Bosch et al. 1992). 

In humans and mice, Igh-V, Igh-D^ Igh-J and Igh-M 
constant genes have been extensively characterized with 
similar but less detailed work reported for rats, sheep, 
cattie, dogs, cats, rabbits, hamsters, shrews, pigs, ducks, 
and several fish (Bosch et al. 1992; Meek et al. 1990; 
Parker et al. 1994; Vasquez et al. 1992; Hordvik et al. 1992; 
IshigUFo et al. 1989; McCimiber and Capra 1978; Bernstein 
et al. 1982; Patri and Nau 1992; McGuire et al. 1985). 
Based on comparative analyses firom these studies, mam- 
malian variable gene segments appear to have been derived 
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Fig. 1 Alignment of predicted 
amino acid sequence for seven 
cDNA clones encoding horse Igh- 
V, Igh-D and Igh^ genes. The 
ends of the variable gene segment 
and the diversity segment are es- 
timates based on lineiq)s with 
human and mouse germline Ig 
sequences. Sequence identity 
widi clone 114 are shown as 
dashes (-). Gaps (*) were in- 
serted to maximize alignment. 
Nucleotide sequences of cDNA 
clones have been submitted to 
GSDB (accession numbers 
L81155-L8n61) 
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from three distinct progenitor genes (Tutter and Riblet 
1989). Descendants from these genes are categorized into 
three dans based on nucleotide sequence similarity in FWl 
and FW3 regions (Kiikham et al. 1992). Each clan contains 
one or several families (>80% nucleic acid sequence 
identity places two genes within the same VH family 
(Cook and Tomlinson 1995). In humans, at least 95 Igh-V 
segments comprising 7 families and 6 Igh-J gene segments 
have been identified (Chothia and Lesk 1987; Cook and 
Tomlinson 1995). In mice, 13 Igh-V families and 4 Jgh-J 
segments are known (Meek et al. 1990). In horses, only the 
IgE heavy chain, including four /^/i-F sequences, have been 
characterized (Navarro et al. 1995, and unpublished data 
GSDB accession number: U17041). In fliis study, we 
characterize the nucleotide sequences of 7 unique horse 
Igh-Vgcnes^ 5 /gW segments, and the /gM constant region, 
including the secretory peptide, and compare our findings 
with those from other species. 



tific, Friendswood, TX) according to the manufacturer's instructions. 
Seven |ig of total RNA was used for cDNA synthesis with MMLV 
reverse transcriptase (Gibco BRL, Eggenstein, Germany) and random 
hexanucleotides as described (Schrenzel et al. 1994). PGR reactions 
contained 5 mM MgCh, 50 mM KCl, 10 mM Tris-HCl pH 8.0, 0.2 mM 
of each dNTP, 50 pmol of each primer, and 0.5 units Tag DNA 
polymerase (Promega, Madison, WI) in a final volume of 50 ^1. 
Primers used for detection of horse JgM gene expression were 
degenerate and based on alignments of mu chain-encoding ccmstant 
regions of other species. The sense primer was AAG CC( AC) G( AT)G 
GCC TGG (CT)TG TC and the anti-sense primer CTA G(AG)C TGC 
TGT GCT TCA CCC designed to generate a 290 bp PGR product PGR 
conditions were 95 for 45 s, 54 °G for 1 min, and 72 ""C for 1 min 
for 30 cycles. This PGR fragment was cloned into pGEM-T using the 
TA Cloning Kit from Promega (Madison, WI) according to the 
manufacturer's instructions. Primers used to amplify a broad spectrum 
of horse Ig genes were a degenerate oligonucleoti(te primer 
GGGG(GACXGTXGAG)TATT(AT)GTGT designed to bind FW3 
based on alignments of horse Igh-V genes and an anti-sense constant 
region primer GTGTTGTTGTGGTAGTTGGAG. PGR conditions were 
95 °G for 45 s, 54 for 1 min, 72 X for 1 min for 26 cycles. cDNA 
from normal horse lymph node and from a horse B<ell lymphosar- 
coma along widi plasmid-cloned horse Ig genes were used in the PGR. 




Animals 

Samples for RNA and DNA extractions were provided by the Equine 
Research Laboratory and Veterinary Medical Teaching Hospital at the 
University of Galifomia, Davis. RNA for PGR detection of heavy chain 
rearrangement and clonality expression was obtained from normal 
mesenteric lymph node from a healthy adult Thoroughbred horse and 
an adult Quarterhorse with lymphosarcoma. DNA for Southern blot 
hybridizations was obtained from kidney. 



Polymerase chain reaction (PCR) 

RNA was isolated from mesenteric lymph node from a healthy two- 
year-old Thoroughbred horse and from a lymphoid tumor from a six- 
year-old Quarterfac^ u^g the RNAzoI B method (TM CinnaScien- 



cDNA library construction and saeening 

Total RNA from mesenteric lymph nodes of five healthy unrelated 
horses (three Thoroughbreds, one Standardbred, one Quarterhorse) was 
isolated over cesium chloride gradient and used by Stratagene (La 
Jolla, GA) to constnict a size-fractionated lambdaZapII cDNA library 
with oligo-dT primers. The cDNA library was screened using a horse- 
specific 290 bp IgM constant region probe (described above) labeled 
with [alpha-P32]dGTP using a hexanucleotide random priming system 
(Boehringer Mannheim, Indianapolis, IN). Plaque hybridization was 
carried out for 18 h at 60 ^G in 5x stand^d sodium citrate (SSG), 1 x 
Denhardt's solution, 0.1% sodium dodecyl sulfate (SOS), and 100 \ig/ 
ml denatured herring sperm DNA. Filters woe washed twice with 2x 
SSG and once with Ix SSG at 60 X. Single cl(mes were obtained by 
secondary and tertiary screenings under tfie same conditions. Inserts 
were subcloned into flie Bluescript phagemid vector by in vivo 
excision with Ex Assist helper phi^e (Stratagene). 
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Gene manipulation and DNA sequencing 

Cloned PCR products and isolated cDNA library clones were se- 
quenced by the chain termination method (Sanger et al. 1977) with 
Sequenase 2.0 (US Biochemicals, Cleveland, OH) and with a Prism 
automated sequencing machine (Perldn Elmer, Norwalk, CT) with the 
M40 forward primer, Ml 3 reverse primer, and primers designed for 
regions of the horse IgM constant region gene. 



Sequence analyses 

Sequences were assembled and analyzed with the University of 
Wisconsin Genetics Group (GCG) computer program. A neigjibor- 
joining tree showing a hypothesis of the evolution of the /gAf constant 
region gene was produced by the Phylogenetic Analysis Using Parsi- 
mony (PAUP) program and the data were bootstFq)ped to determine 
levels of support (Saitou and Nei 1987; Swofford 1993). 



Fig. 2 PCR products obtained from amplification with sense degen- 
erate oligonucleotide primer for conserved Ig FW3 region and anti- 
sense primer to IgM constant region, stained with ethidium bromide, 
and run on 2% agarose gel; 100 bp ladder with double bright band 
representing 600 hp on left. Lanes 1-3 selected plasmids containing 
horse Ig genes; Lane 4 cDNA from normal horse lymph node; Lane 5 
horse B-cell lymphosarcoma. Note discrete bands for plasmid clones 
and monoclonal B^cell lymphosarcoma but different size PCR products 
from lymph node resulting in broad smear 

123 456 7 8910in2V+ 



Results 

Fifteen clones, including seven unique Igh-V segments, 
were isolated from a horse mesenteric lymph node cDNA 
library using a 290 bp PCR fragment probe to the IgM 
constant region. Five of the seven variable segments were 
full-length, and all clones contained FW3 (Fig. 1). Nucleo- 
tide sequence analyses for horse Igh-V regions showed 
closest sequence identity with other published horse Igh-V 
segments followed by sheep, cattle, and human genes (data 
not shown). Comparison of the horse Igh-V segments with 
human sequences revealed closest overall nucleotide simi- 
larity with either family 1 of Clan I or family 4 of Clan II 
when using the GCG gap comparison (data not shown). 
Comparison of the seven horse /gA-F genes with each other 
similarly demonstrated the presence of two families. How- 
ever, all horse Igh-V genes had relatively conserved FW3 
regions. The horse FW3 regions were similar in size to 
those found in other species and allowed design of a single 
degenerate primer for recognition of all seven /^/i-F genes. 
This primer was used in the polymerase chain reaction 
(PCR) with an anti-sense oligonucleotide primer to the 
constant region and amplified the sevra cloned horse Igh- 
^ genes as well as cDNA from a normal horse lymph node 
and a B-cell lymphosarcoma (Figs. 2, 3). FWl regions from 
the five ^IMength horse Igh-V genes were also highly 
conserved. 

Variability analysis of horse Igh-V genes demonstrated 
the presence of three CDR (Figs. 1, 4). CDRl (residues 
26-35) had a constant length of 10 amino acids except for 
two sequences (114 and 116) which had additional insertions 
of two residues. Among the horse Igh-V genes, 4 of 6 had 
unique CDRl sequences when compared with data com- 
piled from 77 human IGH-V gene sequences, but all had an 
invariant valine residue at the adjoining residue 24 position 
and 5 of 6 had a glycine at position 26. Horse CDR2 
(residues 50-65) had considerable variability but did con- 
tain a relatively conserved asparagine residue at position 
61. Horse CDR3 (residues 96-101) varied significantly in 
residue content and length with sequences containing from 




Fig. 3 PCR products obtained from amplification with sense degen- 
erate oligonucleotide primer for conserved Ig FW3 region and anti- 
sense primer to IgM constant region, labeled with P32-dCrP, and run 
on 6% polyaciylamide gel; Lanes 1 -7 and 10-12 horse IgM samples 
cloned into Bluescript plasmid; Lanes 8 and 9 horse Tcr beta chain* 
encoding and Tcr delta chain-encoding genes cloned into Bluescript 
plasmid; Lane inverted triangle (d) cDNA from normal horse lymph 
node; Lane plus (+) cDNA from horse B-cell lymphosarcoma. Note 
lack of band formation for Tcr genes and polyclonal lymph node cDNA 
but specific recognition of all plasmids containing horse Ig genes and 
monoclonal B-cell lymphosarcoma 

6-10 CDR3 amino acids. However, 5 of 6 sequences 
contained a leucine or phenylalanine near residue 100. 

Horse Igh-D regions were estimated based on compar- 
isons with germline Igh-V md Igh-J sequences from other 
species and contained from 6-10 amino acids with an 
average of 8.1 . Among the CDR, the regions formed within 
the Igh-Vy Igh-D, and Igh^ junctional region had the high- 
est level of variability. However, 6 of 7 horse Igh-D regions 
contained at least one glycine residue widiin die Igh-D 
junctional region and 5 of 7 contained a tyrosine residue 
(Fig. 1). 

Seven different horse Igh-J genes were identified. How- 
ever, three of these genes differed by only 1 or 2 bp and, 
thus, may represent alleles of the same gene or reflect 
differences in N region modifrcation at the 5' terminus of 
the same gene segment. Thus, our analyses appear to have 
identified five unique Igh-J segments (Fig. 1). The five 
separate horse Igh-J genes had from 82.5% to 94,0% 
nucleotide identity with each other and an average of 
90.3% identity. Predicted amino acid sequences for the 7 
horse Igh-J sequences demonstrated higjily conserved 
areas, particularly at the 3* terminus, although the 5' 
temninus in five of seven horse Igh-J genes contained a 
tyrosine residue. 
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Fig. 4 liability analysis of 
seven translated Igh-Vgene seg- 
ments. Variability was calculated 
as the number of diflferent amino 
acids occurring at a given posi- 
tion divided by the frequency of 
the most common amino acid at 
that position (Kabat 1987). Ap- 
proximations of the complemen- 
tarity determining regions (CDR) 
are shown above and were made 
based on comparison with human 
sequences (Chothia and Lesk 
1987; Chothia et al. 1992) 



CDR1 



CDR2 
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CDR3 




Residue number 



Among the 15 horse genes sequenced, all had the same 
IgM constant region consisting of a 1472 bp segment 
encoding for 451 amino acids with six potential JV-glyco- 
sylation sites (Fig. 5). Also present was a 58 bp segment 
coding for 19 amino acids that represented the IgM secre- 
tory peptide. Southern blot hybrictization of the horse IgM 
constant gene to genomic DNA revealed the presence of a 
single band for Bam HI and Eco RI restriction enzyme 
digests (data not shown). A lineup comparison with ten 
mammalian species, including horse, and two non-mam- 
malian species (duck and salmon) demonstrated the pres- 
ence of 53 invariant residues and 159 highly conserved 
(present in at least 9 of 1 2 species) amino acids throughout 
the length of the IgM constant region (Fig. 6). Nucleotide 
and amino acid sequence comparisons of the horse IgM 
constant region with other species showed closest similarity 
to human and least similarity to duck and salmon. Among 
the four mu chain-encoding exons, CH3 and CH4 were the 
most highly conserved between species. A phylogenic tree 
calculated using neighbor-joining analysis is shown in 
Figure 6. In addition, parsimony analysis revealed similar 
phylogenetic groupings represented by high bootstrap va- 
lues in the neighbor-joining tree and lack of clear resolution 
between other groups (data not shown). 



Ig architecture accommodates an abundance of structural 
diversity allowing binding of numerous antigens within the 
confines of a conserved framework. Conservatism among 



Ig molecules with different antigen affinities was explained 
by discoveries of somatic recombination and junctional 
diversification for most mammals and intefgenic conver- 
sion in birds and rabbits. These findings were based on 
landmark analyses of Ig nucleotide sequences in mice, 
humans, chickens, and rabbits (Honjo 1983; Tonegawa 
1983; Thompson 1992). Subsequently, Ig gene sequences 
have be«i reported for at least 18 species, inchiding fish 
and amphibians. In this report, we characterize seven horse 
IgM cDNA clones and compare the predicted amino acid 
structures of variable, diversity, joining, and constant region 
segments with those from other species to better understand 
the comparative biology of Ig, 

All seven of our horse Igh-V sequences had highest 
nucleotide identity with one of three previously character- 
ized horse Igh-V sequences, and four had greater than 80% 
identity. These findings indicate that certain /g/r-K families 
may be highly represented in horses. In humans, VH3 Ig 
gene segments from Clan HI are most highly represented in 
both the germline and peripheral repertoire and are consid- 
ered to be essential and conserved components of the Ig loci 
in a variety of manunalian and non-mammalian species 
(Cook and Tomlinson 1995; Tutter and Riblet 1989). The 
Clan QI Igh'V genes are also preferentially expressed 
during fetal life and may function in recognition of primi- 
tive, conserved antigens (Kiikham et al. 1992). When 
compared with other species, horse Igh-V genes had highest 
nucleotide similarity to sheep, cattle, and human genes and 
least similarity to mouse and rat sequences. These findings 
were not unexpected, since horses have shared habitats with 
other herbivores, and thus have been exposed to similar 
selection pressures that shape development of the periph- 
eral immunoglobulin repertoire. Additionally, aitiodactyls 
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and persidactyls probably shared a more recent common 
ancestor than carnivores, rodents, and primates (Novacek 
1992). The high similarity of horse /gA-K genes to sheep 
and cattle genes is similar to findings for horse Tcrd-V 
which were most closely related to sheep sequences 
(Schrenzel and Ferrick 1995). The similarity of some 
horse Igh-V with human genes may have resulted from 
common exposure to certain pathogens or other exogenous 
antigens but more likely represents an artifact stemming 
from the more numerous human Igh-V sequences in the 
genebank. 

Variability plot analyses of horse /gA-F predicted amino 
acid sequences demonstrated the presence of CDR and FW 
similar to those of other species (Cook and Tomlinson 
1995; Chothia et al. 1992; Chothia and Lesk 1987). A 
significant finding was the presence of an 18 bp conserved 
region within FW3 recognized with high specificity by a 
single degenerate oligonucleotide primer in all horse Igh-V 
clones, a normal lymph node, and a B-cell lymphosarcoma. 
These results are similar to those for human IGH-V for 
which a single universal variable gene segment primer has 
been identified. This primer is believed to recognize ap- 
proximately 80% of all hiunan IGH-V genes, including 
those in different Clans, and has proven to be a valuable 



Fig. 5 Complete nucleotide sequence horse IgM constant region gene. 
Predicted amino acid sequence for the translated protein and potential 
iV-glycosylation sites are shown 



diagnostic reagent for assessing B-cell clonality in lym- 
phoid lesions (Ben-Ezra 1992; Wan et al. 1990). Like the 
human and mouse, horse Igh-V sequences also had highly 
conserved residues within CDRl and CDR2 (Chothia et al. 
1992; Chothia and Lesk 1987; Cook and Tomlinson 1995). 
These conserved residues were a valme at position 24 and a 
glycine at position 26 for CDRl and an asparagine residue 
at position 61 for CDR2, These amino acids are believed to 
be important in maintaining the three-dimensional packing 
structure associated with formation of antigen-binding sites 
within the Ig molecules (Chothia and Lesk 1987). However, 
when horse /g/i-FCDRl sequences are compared with the 
human CDRl canonical sequences following positions 24 
and 26, they differ significantly. The human CDRl cano- 
nical sequences are believed to be primarily responsible for 
the reliable duplication of main-chain structures present in 
antigen-binding sites by influencing packing, hydrogen 
bonding, or the ability to assume unusual <p, ^, or oo 
conformations (Chothia and Lesk 1987). The finding of 
unique CDRl sequences in horse Igh-V suggest that a 
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Fig. 6 Alignment of horse IgM 
constant region predicted protein 
sequence with other species. 
Dashes (-) represent amino acid 
residues conserved with the horse 
sequence. Black circles indicate 
invariant residues across all spe- 
cies shown. Gaps (*) were in- 
serted to maximize alignment A 
repetitive peptide motif identified 
in the hamster gene is underlined. 
Percentage identities and simila- 
rities of horse JgM constant re- 
gion wiUi other species is shown 
at the bottom right. References 
with accession numbers are: 
human PC67301, Hartndranath 
et al. 1993); dog (McCumber and 
Capra 1978); rabbit (J00666. 
Bemstein et al. 1982); rat 
(X78895, Parker et al. 1994); pig 
(M92O50, Bosch et al. 1992); 
hamster and mouse (X02804 and 
X03690. McGuire et al. 1985); 
sheep (X59994, Patri and Nau 
1992); shrew (X13920, Ishiguro 
et al. 1989); duck (U27213, un- 
published), and salmon (S48652, 
Horkvik et al. 1992) 
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dilTerent structural motif may contribute to foimation of the 
antigen-binding area. Like the human and mouse CDR3, 
the horse CDR3 contained a phenylalanme or leucine in 
most sequences. Additionally, a glycine was present in six 
of seven horse Jgh-D regions, indicating the importance of 
this residue in the structural integrity of Ig across species. 



Based on sequence comparisons, at least five distinct 
horse Igh-J grae segments appear to exist. This is not 
unexpected, since humans have six joining gene segments 
and mice contain four. The presence of additional horse 
Igh'J segments, however, cannot be excluded without more 
extensive gene analyses. The presence of significant van- 
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Fig. 7 A Neighbor-joining phy- 
logram representing genetic dis> 
tance of published IgM constant 
region genes. The tree was drawn 
using sequenced or predicted 
amino acid residues such that 
lengths of the horizontal branches 
are proportional to the number of 
amino acid substitutions that oc- 
curred on those branches. For 
references see Figure 6. B The 
bootstrap tree represents 1000 
cycles of random character sam- 
pling with replacement (Felsen- 
stein 1985). The bootstrap values 
indicate the level of support for 
the numbered branches 



Neighbor Joining 



B 



BootstfBp 



I M< 

I — 



' RabMt - 



' Human 



Dog 



' Shrow 



Sheep 
-Wg 



ability in the 5' region of the horse Jgh-J genes indicated 
that, like mice and humans, junctional diversification oc- 
curs through modification of the joining segment as well as 
the diversity and variable genes. It is also likely that further 
modification the 5' horse Igh-J genes occurs due to somatic 
hypermutation as in other species (Chothia and Lesk 1987). 
Despite diversification of the 5' joining segment and its 
functional role as a constituent of the hypervariable region, 
5/7 horse Igh-J genes sequenced contained a tyrosine 
residue in this region. This is similar to findings for humans 
and mice which have a 5' tyrosine in 4/6 and 4/4 joining 
segments, respectively (Chothia and Lesk 1987). The sig- 
nificance of these observations is not certain, but it has been 
shown that tyrosine residues have a high normalized 
frequency of occurrence in p-sheets (Creighton 1984). 
Phenylalanine, another aromatic amino acid, likewise oc- 
curs frequently in p-sheets and is present in 5/6 human 
IGH'J and 3/4 mouse Igh-J in the 5' region (Creighton 
1984; Chothia and Lesk 1987). Thus, it appears that the 
presence of an aromatic amino acid in the 5' joining 
segment is important to the integrity of the Ig structural 
domain in this area, perhaps by contributing to P-sheet 
formation. 

A single horse IgM constant region was present and 
contained highly conserved cysteine and tryptophan resi- 
dues as seen in other species. Additionally, comparison of 
hotse /gA/ sequence with nine other mammals and two non- 
mammalian species revealed the presence of 53 invariant 
residues and 159 highly conserved amino acids. The high 
level of structural similarity within the constant region may 
be related to the constraints placed on the constant region 
for assembly of light chains and a duplicate heavy chain to 
form a functional protein. Conversely, the conservation of 
amino acids, especially within exon 4, may be a reflection 
of the primary role of IgM in activation of the complement 
system which is highly conserved even between divergent 
species, the role of IgM as an Fc-bound surface receptor, or 
the requirement for polymerization of IgM molecules 
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(McCimiber and Capra 1978). Among the four exons 
comprising the constant region, exon 4 by far contains 
the most invariant and conserved residues. All twelve 
species analyzed also contained a potential M-glycosylation 
site and a cysteine residue within the secretory peptide. 
These regions have been suggested to be critical to the 
formation of IgM multimers and, in accordance with 
previous reports, indicate that multimers of secreted IgM 
likely occur in birds and fish as in mammals. The horse, 
like other mammals, lacks other potential iV-glycosylation 
sites within exon 4. In contrast, many lower vertebrates 
contain one or several A/-glycosylation sites in the last 
coding exon. The significance of this potential for structural 
diversity is unclear but may be related to the role of IgM in 
complement activation or possibly multimer formation. An 
additional feature that was revealed after aligrmnent of all 
species was the presence of a repetitive 1 5 amino acid motif 
within exon 3 in the hamster gene. The presence of this 
repetitive sequence is similar to findings for the sheep and 
cattle TcTg'C genes which contain duplicated peptide motifs 
within the coimecting peptide (hinge region; Hein et al. 
1990; Takeuchi et al. 1992). The functional significance of 
diese repetitive sequences has never been determined. Un* 
like most other Ig isotypes and T-K^ell receptor proteins, IgM 
does not contain a hinge region (Kuby 1994). 

Comparison of the horse constant region amino acid 
sequence with other species using a phylogenetic analysis 
indicated that the horse /gM chain has closest evolutionary 
distance to humans and dogs. This result was somewhat 
unexpected, particularly since the horse Igh-V genes were 
most closely related to die artiodactyls (sheep and cattle). 
Interestingly, the horse Tcrb-C and Tcrd-C also have highest 
similarity to human and dog sequences (Schrenzel et al. 
1994; Schrenzel and Ferrick 1995). The significance of 
these findings is unclear. Perh^s, structural conservation of 
the T-cell receptor and Ig constant regions among horse, 
human, and dog indicates a relatively close evolutionary 
relationship between the immune systems of these species 
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on a molecular level. Analysis of other immune cell 
proteins, such as Thy-1 production by T-cells, also demon- 
strates striking similarities between horses and humans 
(M.D. Schrenzel, unpublished observation). Furthermore, 
a recent report comparing protamine sequences from a 
variety of mammals by a cladistic method similarly 
grouped horses more closely with himians than with aitio- 
dactyls (unfortunately, members of the Camivora, such as 
dogs, were not included in the analyses; Ishiguro et al. 
1989). This finding differs from most common interpreta- 
tions of eudierian phylogeny but parallels our results for 
IgM. However, additional analyses of horse inmiune cell 
genes and proteins are needed to reach substantial conclu> 
sions about the relationship between the immune systems of 
horses, dogs, and himians. 

In summary, we characterized seven horse Igh-V genes 
and identified a single JgM constant segment. Structurally, 
the horse genes had many similarities to Ig genes of other 
species but also differed in some ways including the 
presence of unreported sequences within canonical areas 
of the CDR. Most significantly, however, the horse Igh-V 
sequences contained a fairly conserved region within FW3 
that allowed PCR amplification of different families with a 
single degenerate oligonucleotide. This data along with 
sequence information for the joining and constant region 
segments should provide the basis for in depth analyses of 
equine inunune-related diseases. 
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GenBank: AY158087.1 



Bos taurus immunoglobulin heavy chain 
(IGH) gene, partial sequence 



• Features 

• Sequence 

LOCUS AY158087 8259 bp DNA linear MAM 10-SEP- 

2003 

DEFINITION Bos taurus immunoglobulin heavy chain (IGH) gene^ partial 
sequence . 

ACCESSION AY158087 

VERSION AY158087,1 GI: 34558505 

KEYWORDS 

SOURCE Bos taurus (cattle) 

ORGAN I SM Bos taurus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; 

Euteleostomi; 

Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; 

Ruminantia; 

Pecora; Bovidae; Bovinae; Bos. 
REFERENCE 1 (bases 1 to 8259) 

AUTHORS Zhao,Y., Kacskovics, I . , Pan,Q., Liberies, D. A. , Geli.J., 
Davis, S.K. , 

Rabbani,H. and Hammarstrom, L. 
TITLE Artiodactyl IgD: the missing link 
JOURNAL J. Immunol. 169 (8), 4408-4416 (2002) 
PUBMED 12370374 
REFERENCE 2 (bases 1 to 8259) 

AUTHORS Zhao, Y . , Kacskovics, I . , Rabbani, H. and Hammarstrom, L. 
TITLE Physical mapping of the bovine immunoglobulin heavy chain 
constant 

region gene locus 
JOURNAL J. Biol. Chem. 278 (37), 35024-35032 (2003) 
PUBMED 12829708 
REFERENCE 3 (bases 1 to 8259) 

AUTHORS Zhao,Y. and Hammarstrom, L. 
TITLE Direct Submission 

JOURNAL Submitted ( 03-OCT-2002 ) Center for Biotechnology at Novum, 
Karolinska Institute, Huddinge, Stockholm S-141 57, Sweden 
FEATURES Location/Qualifiers 
source 1..8259 

/organism="Bos taurus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9913" 
/ chr omo s ome ="21" 
/map="21q23-q24" 
gene <1..>8259 

/gene="IGH" 

/note=" immunoglobulin heavy chain" 
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misc signal 
misc signal 
misc signal 
J segment 
misc signal 
J segment 
misc feature 
misc feature 
C region 



210. .260 

/gene="IGH" 

/note="JH-PSl" 

390. .440 

/gene="IGH" 

/note="JH-PS2" 

737. .784 

/gene="IGH" 

/note="JH-PS3" 

1018. .1062 

/gene="IGH" 

/note="JHl" 

1401. .1448 

/gene="IGH" 

/note="JH-PS4" 

1925. .1975 

/gene="IGH" 

/note="JH2" 

2849.-2883 

/gene="IGH" 

/note="enhancer core region* 

4360. .7380 

/gene="IGH" 

/note="mu switch region" 
7821. .8141 
/gene-"IGH" 
/note="mu CHI" 



ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 



gggaggggca 
tcaaatccca 
ttcaggaaga 
ctcccagcac 
ccctgggcac 
gacttgggtg 
gaaaagaagc 
ccccggtcac 
ttccctgggg 
ggtcccgccc 
gacaggagct 
agtgagggca 
gactgtgtcc 
ctcaggtgag 
cccttggtct 
ctctctgccc 
aggacgccag 
gtcgatgcct 
ctctctcctc 
tctaaggggc 
gcaccgagga 
tctgggggcc 
gaggcacagg 
gcatcattgt 
tctgctcagc 
gagatccagg 
gcagagcaag 
gcgggtctgg 
ggctgctcga 
tggtctgcct 
caagtgactt 
cctcgagcct 
tgattactat 



ggagcaatgt 
gaccaggggc 
cagacttgca 
ggggccagcc 
cgtctcctca 
gacttgggat 
aacagaatgg 
cgtgtccttg 
tccgtgcagc 
ttggatctgc 
ggctgggcgg 
ggggagaccg 
ctcagcaatg 
acggctctct 
ggggcagacc 
tgttgaggtg 
gccttgggtt 
ggggccaagg 
actctctctc 
ctgggcagcc 
gcagaggctc 
tcagcaccct 
tgaggtggtc 
caccgtgtaa 
tgagccctca 
ggcattctgg 
acctcctcgc 
ctgcctgact 
ggctggactt 
ctggggacca 
tgaccgacgg 
tgccaggctc 
agtatatatg 



gacctcatgg 
ctgaggcggg 
gctcctgggg 
cactgtgact 
gctgaatctg 
gtgagtcagg 
aagccatgct 
gggagtgtct 
cccaccgtgc 
cggagcccct 
gaggaaacgt 
ggcctggcct 
cttttgactc 
gcccgctccg 
ctccgtggtc 
gcgcctctgc 
tttgcacagc 
actcctggtc 
agggttttgg 
cgggggtctg 
caggcaccct 
tgggctctgg 
cggagcctca 
caactggctc 
ccaccccact 
aggtcaagaa 
cccaaggaca 
tgagcaggac 
aggagtctgt 
gaatgggaca 
cttccctgtg 
ccgaggtttt 
tttgcggccg 



gaacaaagca 
caggtgcttg 
gctccctggg 
atgctgactt 
gtgtctagtg 
gccagcaagg 
gctgggacat 
cccttgcagc 
cctgagcaag 
ggagatgagc 
gggcagagcc 
ccctagggtt 
ctggggccag 
tcctgggccg 
cctgggggat 
ctgtggaaca 
ccctagcggg 
accgtctcct 
tgcactttgg 
aaacactgag 
ctcctcctgg 
actcccaagg 
gccggccacg 
aagcactggg 
tcactgcacc 
agggagctgg 
ccgcgatgtg 
caggggcttc 
ggttcacggt 
tagtgtctct 
gcgcctggag 
tgttgggcga 
agggatcgag 



gatcaggaga 
tggggtgggg 
gaccctgggt 
ccatctctgt 
tccaagtcag 
aggtagggtg 
ggatctctgg 
actgcctggg 
gtagggtcct 
cagctgaggc 
cgacggggca 
catgtcttgg 
cgcgccccgg 
gggaagatgt 
ctgcgtctga 
ctggctttgc 
gcccacggca 
caggtgagtc 
ggaaaatcga 
ggcccaaggg 
gctctgtagc 
tcccagctgt 
agctcttgcc 
gtcgggaagc 
cggggagacc 
ggagagggtt 
ggtacgaggt 
cgtcgctttc 
cagccggccc 
ggcagagtca 
atggcgtggg 
ggctggagat 
gtcaccgtct 



cagcagacac 
ggcaagtgtc 
ggtttccgtg 
ggccaggctg 
gggacagtgg 
tgtttttgtg 
ggccagcgca 
cctcaggcag 
ggtccaacgg 
ctgaggggag 
gggtctgttc 
gggcagccag 
tctccatctc 
ctccaggggc 
ggccgtttgc 
atgggggcag 
ctgtgactac 
ctcaacagcc 
gggtgtcggg 
caggcttaca 
cagggctttc 
gctgcctcat 
tggggtccca 
ctgggcactg 
cagggtgtca 
ctggtgacag 
ggctcctttg 
tggggcaggc 
aggcaggcag 
gggtcacaca 
ggcccaggcg 
atcaccactg 
cctcaggtaa 
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1981 gagcggccca tccagggcct ttgctttatc tcatttcgtg cgatttttct gagcatcact 
2041 gtccggtcct ctgatgtgtc cttgtcccct cctccccggg gggactcggc agactggcca 
2101 ggaggggacc agctgcccta tgcatttcag agtctctatt ttctgatgcc tttaaaaaat 
2161 cagaatttca ttggcattca gagggggctt gggcgggaag ggccaccagt gggggaggcc 
2221 caggcccccc ttggcagcag ggcagcttgg gatgcggtgg aaggcaactt gccacggtcc 
2281 tagcatctgc agaggagcat gtctggataa ttagggcctc aggaacgctg cccgcggtgg 
2341 gcacagagaa cgccctcctc gggtgaggtg tgttctgcac tagactgtgt ttaaaattct 
2401 ttattgggta ggaagagaat tgtctaggtg aggacggaca tgcagtgtcc cgatcgtggc 
2461 gagagagggg agcccgggga ggtgacgggc gctgggcttt gtgaggccag tctaagagag 
2521 aaaggccgtt cgccagagga ggtgtgcttg cgaacaccaa gacagggcat ctctgaagcg 
2581 attcctgata gtctgaaaaa ttgaaacttt aaaaagaaat gtttaaagta ttttaaattt 
2641 ttatcattta attaacaacc gcaaatcatg gctttggaga gttgagacag gtacaagttt 
2701 ggccgaaaag tactaactag gttccatcgg ccctcggccc caattcaggg ctgttttgag 
2761 aataataaat tcagcttatt tttttaatgt aattggtggt gccgagttag tcaagatggc 
2821 cacgggcagg actgaccacc tgcagcaggt ggcaggaagc tgagagtctg tttttggaag 
2881 caagaaaaca cagttggtaa atttatagct tctggttccc aaaaggtggt ttgcggctgg 
2941 ttttgcccag ccccacagaa ccgaaagtgt tcccctgagc ggagcaacac ctggctaatt 
3001 tgcatttcta aaacaaggcg cagatgctga ccgaaactgg aaggttcctc ttttaactat 
3061 ttgaatttac ttcagcttta gcttatcaac tgctcactta tattcatttt caaagtcgat 
3121 gtttaagaaa ttaatttgtc tcaggtgtat tttatcaatg caatcgtatg ctctttaatg 
3181 agttactcct tcaaggtagt taagctgaga ctgtgcgcac gccgtgtgca ctgagtgtgc 
3241 tcacaatcgt agctggccgg ctccctcctg gcgatgaagc agctttctca gtgcggctcc 
3301 ctcactgcag aatgttcagg gccgcctttc gggtacccat ctcaggacag acttgggtga 
3361 aacgggggca ttcgtcgtga gttactctgc tacactacta gctgggggat tgagaggatg 
3421 cccagtgggg tgggagtcag cggggcctgg gacggcccgc agctggggac gtgcgggcct 
3481 ggcctctgcc gccaggccag tgggcaggtc acgagttagc acagtgaagg gattcagagg 
3541 aatccaccca ggtcagagag cccacgggac atgccaggga ctaggcgccg agctgtgagg 
3601 ggcaggagcc ggtgcagcca gactcgctgc cccgggacgt ttgtatgagt tgtttttgtt 
3661 tctacttgaa agggaatctg tctttaccta ctgcctgcga tgccggggtg cttcacaccc 
3721 acagggacca cttcctgcgt ccacacttag gtgttcaatg cagctttggg gttcagatgg 
3781 tctctgtttg ctagagagta tcctctttct cagcagaact ttgtgataaa gacagaattg 
3841 ggagcaggtt cagataagat gcatttcccg tggtttttaa tgaggacatt ggccagagag 
3901 cagttagtca ccaggactct tttagactcg gggcaggatc ttaggcccac ggcatgtgcc 
3961 caggaggcca ggccccaggg acgtatgccc aggctctcct ctcccagaga agcagcgctg 
4021 cctgctcagc agagacaggg cctctgtctg agggtggcaa gcgtgtctca gcctgtgaga 
4081 tgcagtaggg aagctaaagt cgtcacgatg cttttcaccc agagtggagg gtggtggtga 
4141 ctgacggctg ggagaagggc cggctgggcc tggtgtgggg atggaggcat cgggcaccca 
4201 ggagaagcag gaagaggctg gaaggggtga gcccagctgg gcagcatccg cagaccaggg 
4261 gctttagccg aggccacggg tccgggatct gaccctttct cgtcaggacc actgggcttt 
4321 cacgtctgtg agggcagcct , gggccaccga cagcaggtga gctggtctgg gctgaactta 
4381 gctgagccag gttgggctgg gctgagctgc actgagctgc gctgggctga gctgggttga 
4441 gctgggctga gcagagctgg gcagagtggg tgagctgggc tgagctgagc taagctgaac 
4501 tggagtgggt gagctgggct gggctgggct gggctgggct gggttgagct gggctgggct 
4561 gagctgcact gagctgggct aagctgggtt gggctgggct tagtggggct gggcggagtg 
4621 ggtgagctgg gctgagctgg attaggctga gttgagcgga gctgggcaga gtgggtgagc 
4$81 tgggctgagc tgggttaggc tgagttgagc tgggctcagc tgggctgagc tgggtggagt 
4741 gggtgagctg ggctgaactg ggttaggctg agttgagcta ggctcagctg ggctgcgctg 
4801 agctgcactg agctgagctg ggctgggctg ggctgagcgg agctgggagg agtggatgag 
4861 ctgagctaag ctgggttagg ctgagttgag ctgggcttag ctgggctggg ctgagcggag 
4921 ctgggaggag tgggtgagct gagctaagct gggttaggct gagttgagct ggactcagct 
4981 gggctgcgct tagctgggct gggctgagca gggctgagca gagctgggcg gagtgggtga 
5041 gctgggctga gctgggttag gctgggctga gctgggttag gctgcgatga gctgggctga 
5101 acggagctgg gcggagtggg tgagctgagc tgggctgggc tgagctgggc agagtgggtg 
5161 agctgggctg agctgggttg ggctgagttg agataagctc agctcggctg agctgggcag 
5221 agtgggtgag ctggactgag ctgagctggg ctgggctgag ctgggcggag cagagctggg 
5281 cagagtgggt gagctgggct acactgggct gggctgagct gcactgagct gggctgggct 
5341 gagctgggct gagcggagct gggaggagtg ggtgagctgg gctaagctgg gttaggctga 
5401 gttgagctgg actcagctgg gctgcgctta gctgggctgg gctgagcaga gctgggcgga 
54 61 gtgggtgagc tgggctgagc tgggttaggc tgagctgggc tgagcggagc tgggcagagt 
5521 gggtgagctg ggttgggctg ggctgcgctg agctgggctg agcggagctg ggcagagtgg 
5581 gtgagctggg ctgagctggg ttaggctgag ctgagcggag ctgggcggag tgggtgagct 
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5641 gggttaggct gggctgggct gggctgggct 
5701 gctgggctgg gctgacctgg gctgagcaga 
5761 tgggctgggc tgagctgagc taagctgaac 
5821 agctgggctg agctgggttg agctgggctg 
5881 ctgggcagag tgggtgagct gggctgagct 
5941 gctgaatgga gtggctgagc tgagctgggt 
6001 agctgggtgg ggtgggtgag ctgagctgag 
6061 gggttgagct gggcagagtg ggtgagctgg 
6121 tgagctgagc tgaatggagt ggctgagctg 
6181 ctaggctgag ctgggttgag ctgggctgag 
6241 gggctgagct gggccaagtg tgtgggctgg 
6301 gctgagctga gctgggcttg gttgagctgg 
6361 gctgggcaga gtgggtgagc tgggccgggt 
6421 tggcctgagc tgggctgagt gggtgagctg 
6481 gtgagctggg ctgggctggg ctgagctggg 
6541 ctgagctggg cagagtgggt gagctgggct 
6601 gtgttgagct gggcagagtg ggtgagctag 
6661 gctgaactga gctgagttga gttgggctga 
6721 actgagctga gctgggttga gctgggctgg 
6781 gttgagctgg gcagagtggg tgagctaggc 
6841 gctgggctga gctgggttga gctgaggtga 
6901 gctgcgcaga ttgggtgagc tgggctgaac 
6961 gagtgggtga gctaggctga gctgggctga 
7021 gctgaactga gctgacccag gttgagctga 
7081 gctggcagat tgggtgagct aggctgagct 
7141 gcagagtggg tgagctaggc tgagctgggc 
7201 ggagtgggtg agctgggctg agctgagctg 
7261 tgagctgggc tgagagggtt gagctgagct 
7321 ggattgggct gatctggatt gaactgggct 
7381 gggctagcct gggcttaact ggggtaagct 
7441 aacagaaggg aacccactga gctagcttga 
7501 catttaacct ccactgaggc tggtgggtct 
7561 tctggtcaag agtgggctgg cctcagtgat 
7621 gtgtatcagt cttattgatg accccagacc 
7681 ggcagtgagg gtgggctggc agggctgagt 
7741 ggtggagctg cccacacccc cgaccagcag 
7801 cctctctcgg gtccccagaa ggtgaatcac 
7861 tgagctcgcc atccgatgag agcacggtgg 
7921 ccaattcagt cagcttctcc tggaagttca 
7981 ggaccttccc cgaagtcctg agggacggct 
8041 cctcctcaag cgcctttcaa gggccggatg 
8101 agggaggaaa gaccgtcggc accgtgaggg 
8161 gtggttgggt gcggggggag ggtccaggcc 
8221 cggaagtgct gtccccagtc gtgagtgtct 



gggcagagtg ggtgagctgg actgagctga 
gctgggcgga gtgggtgagc tgggctgggt 
tgggctgggt tgagctgggt ggagtgggtg 
agctgaatgg agtggctgag ctgggttgag 
aggtggagtg ggtgagctga gctgagctga 
tgagctgggc agagtgggtg agctgggctg 
ctgagctgaa tggagtggct gagctgagct 
gctgagctag gtggagtggg tgagctgagc 
agctgggttg agctgggcag agtggctgag 
ctgaatggag tagctgagct gggctgaggg 
gctgggctgg gcttagttgg gttgacctgg 
gctgagcttg gctgagcttt gctgggctga 
tgggttgagc tggcctgagc tgggttgagc 
ggctgggctg ggctgggctg ggctgagttg 
gtgagctggg ctgggctggg ttgagctggc 
gaactgagct gagctgggtt gagcaaggct 
gctgagctgg gctgagctgg actgagctcg 
gctggcctgg gctgggctga gctgggctaa 
gttgagctgg gctgggttga gctgggctgg 
tgggctgggc tgtgctggca gagtgggtga 
gctgggctga actgagctgc actgtgttga 
tgagctgagc tgggctgttt aaactgggca 
actgagctga gttgagttgg gctgagctgg 
gctgggttga gctgggcagg gctgggctgt 
gagtgggctg agctgggctg tttaagctgg 
tgaactgagc tgagctgggc tgagctgggc 
ggcagaactg ggcttagctg gctgagcagg 
gagctgaact gagctgggtt gagctggcct 
gaactgagct gagctgggct gagctgggct 
gggttaacca tggtgggctg agactgggtg 
cttgggcaaa ttgttgagat gctttgggta 
agatggactg agctgacccc actggactgt 
ctgattgtgc tgaggactgg gagcctgagt 
cccagggcag gccccaggag gctaatcgcg 
gtgccctccg ctccactgtc ccagagagaa 
gatgcctctc acccccctct tctcctgtgt 
acccgagagt cttccccctg gtgtcctgcg 
ccctgggctg cctggcccgg gacttcgtgc 
acaacagcac agtcagcagc gagagattct 
tgtggtcggc ctcctctcag gtggtcctgc 
actacctggt gtgcgaagtc cagcacccca 
tgatcgctac aagtgagtca ggtacgtccc 
ccgctaaccc cttgtccttc tctgcagagg 
ttgtcccgc 
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GenBank: M13800.1 

Rat Ig germline mu H-chain C-region gene, 
exon 2 



• Comment 

• Features 

• Sequence 



LOCUS RATIGCF3 659 bp DNA linear ROD 27-APR- 

1993 

DEFINITION Rat Ig germline mu H-chain C-region gene, exon 2. 

ACCESSION M13800 

VERSION M13800.1 GI:204709 

KEYWORDS C-region; germline; immunoglobulin heavy chain; immunoglobulin 
mu-chain . 

SEGMENT 3 of 3 

SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Rattus. 
REFERENCE 1 (bases 1 to 659) 

AUTHORS Bruggemann^M. , Free, J., Diamond, A., Howard, J., Cobbold,S. and 
Waldmann,H. 

TITLE Immunoglobulin heavy chain locus of the rat: striking homology 



to 

JOURNAL 
PUBMED 
COMMENT 
1, 



provided 

FEATURES 

source 



gene 
intron 

CDS 



mouse antibody genes 

Proc. Natl. Acad. Sci. U.S.A. 83 (16), 6075-6079 (1986) 

3016742 

Original source text: Rat (PVG) liver DNA, clones 22-3-1, 5-2- 
45-3-1, 31-1-1, 18-1-3, and 24-3-1. 

Computer- readable copy of the sequence in [1] was kindly 

by M.Brueggemann, 19-NOV-1986. 

Location/Qualifiers 
1. .659 

/organism="Rattus norvegicus" 
/mol_type=" genomic DNA" 
/db xref="taxon: 10116 " 
1. .659 
/gene="IgM" 
<1. .77 
/gene="IgM" 
/note="Jl-C intron" 
/number=l 
78. .416 
' /gene="IgM" 
/note="C-region" 



Annex IV 



/nuinber=2 
/codon_start=3 

/product^" immunoglobulin mu-chain" 
/protein id=" AAA41372 . 1 " 
/db_xref ="GI : 554448" 

/translation="WEMNPNVSVFIPPRDAFSGPAPRKSRLICEATNFSPKQITVSW 

LQDGKPVKSGFTTEPVTVEAKGSRPQTYKVISTLTITESDWLNLNVFTCRVDHRGLTC 

LKNVSSTCAA" 
intron 417..>659 

/gene="IgM" 
/number=2 

ORIGIN 275 bp upstream of BamHI site; about 7 kb after segment 2. 

1 ggcccaggca tggcccagag ggagcagcga gtgggtctta agccagcctg agctcacacc 
61 tcaacctttc attccagctg tcgttgagat gaaccccaat gtgagtgtgt tcattccacc 
121 acgtgatgcc ttctctggcc ctgcaccccg caagtccaga ctcatctgcg aggccaccaa 
181 cttcagtccc aaacagatca cagtatcctg gctacaggat gggaagcctg tgaaatctgg 
241 cttcaccaca gagccagtga ctgtcgaggc caaaggatcc agaccccaaa cctacaaggt 
301 cataagcaca ctgaccatca ctgaaagcga ctggctgaac ctgaatgtgt tcacctgccg 
361 cgtggatcac aggggtctca cctgcttgaa gaacgtgtcc tccacatgcg ctgccagtga 
421 gtagcctgtg ctaagcccaa tgcctagccc tcccacatta gagcagtcct cctacggttg 
481 tggccaatgc cacccagaca tggtcatttg cttcttgagc cttggcttcc aacagtggcc 
541 aaggccaagg atgagcagta ggcagcaggg ggatgagagt cagatggagg gaatcagcat 
601 cttcccttaa gcagatttgg aagatggaga ctgagctttt atccaacttc acaactaga 
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GenBank:X02804.1 



Syrian hamster Cmu-gene for 
immunoglobuliii heavy chain 



• Comment 

• Features 

• Sequence 



LOCUS 
2006 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



X02804 



2641 bp 



DNA 



linear ROD 14-NOV- 



Syrian hamster Cmu-gene for immunoglobulin heavy chain. 
X02804 M23867 
X02804.1 GI:49635 

constant region; Ig heavy chain; immunoglobulin. 
Mesocricetus auratus (golden hamster) 
Mesocricetus auratus 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Cricetidae; Cricetinae; Mesocricetus. 
REFERENCE 1 (bases 1 to 2641) 

AUTHORS McGuire^K.L. , Duncan, W.R. and Tucker, P. W. 
TITLE Phylogenetic conservation of immunoglobulin heavy chains : 
direct 

comparison of hamster and mouse Cmu genes 
JOURNAL Nucleic Acids Res. 13 (15), 5611-5628 (1985) 
PUBMED 2994005 

COMMENT On Feb 24, 2005 this sequence version replaced gi: 191395 . 

Data kindly reviewed (19-FEB-1986) by P.W. Tucker. 
FEATURES Location/Qualifiers 
source 1..2641 

/organism="Mesocricetus auratus" 
/mol_type~"genomic DNA" 
/db xref="taxon: 10036 " 
gene join(<466. . 778, 882 1220, 1474 . .1791,1906. .2300) 

/gene="Cmu" 

CDS join (<466. .778,882. .1220,1474. .1791,1906. .2300) 

/gene="Cmu" 

/ codon_start=l 

/product^" immunoglobulin heavy chain" 

/protein id=" CAA26574 . 1 " 

/db_xref="GI : 1333850" 

/db_xref ="GOA : P06337 " 

/db_xref =" Inter Pro : IPR003006 " 

/ db_xr e f = " I n t er Pr o : IPR003597 " 

/db xref="InterPro: IPR007110 " 

/db_xref =" InterPro : IPR013783 " 

/db xref="UniProtKB/Swiss-Prot :P06337" 



/translation="SPSSPTVFPLVSCESPLSDENLVAMGCLARDFLPSSISFSWNyQ 



NKSEVNQGVRTFPTLRMGEKYAATSQVFLPPKSVLEGSDEYLVCKVHHGNTNKDLRVP 
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IPWTEMNPNVSVFVPSRDAFSGPAPRKSRLFCEASNFSPKQITVSWLRDGKPVKSGF 

TTEPVTPEDRGSGPRTYKVISTLTITESDWLNLSVYTCRVDHRGLTFWKNVSSTCAAS 

PSTDIQAFPIPPSFVGIFLNKSATLTCLVTNLATYDTLNISWSSRSGEPLETKTKLTE 

SHPNGTFSAIGEANVCVEDWDSGKEFVCTVTHRDLPSPQKKFISKPREMNKTPPAVYQ 

QPLAREQLILRESATVTCLVKGFSPADIFVQWLQRGQPLSQDKYVTSAPMREPQAPHL 

. YFTHSVLTVTEEEWNSGETYTCWGHEALPHMVTERTVDRSTGKPTLYNVSLIMSDAG 

GTCY" 

exon 466. .778 

/gene="Cmu" 

/nuinber=l 
misc feature 598.. 606 

/gene="Cmu" 

/note="pot. glycosylation site" 

intron 779. .881 

/gene="Cmu" 

/nuinber=l 
exon 882.. 1220 

/gene="Cmu" 

/nuinber=2 
misc feature 902. .910 

/gene=" Cmu" 

/note="pot. glycosylation site" 
misc feature 983. .991 

/gene="Cmu" 

/note="pot. glycosylation site" 
misc feature 1142. .1150 
/gene="Cmu" 

/note="pot. glycosylation site" 
misc feature 1196. .1204 
/gene="Cmu" 

/note="pot. glycosylation site" 
intron 1221.. 1473 

/gene="Cmu" 

/ number=2 
exon 1474. .1791 

/gene="Cmu" 

/number=3 
misc feature 1533. .1541 

/gene="Cmu" 

/note="pot. glycosylation site" 
misc feature 1590. .1598 
/ gene =" Cmu" 

/note="pot. glycosylation site" 
misc. feature 1659.. 1667 
/gene="Cmu" 

/note="pot. glycosylation site" 
intron 1792.. 1905 

/gene="Cmu" 

/nuinber=3 
exon 1906.. 2421 

/number=4 
misc feature 1911. .1919 

/gene="Cmu" 

/note="pot. glycosylation site" 
misc feature 2256.. 2264 
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polyA signal 
polyA site 
ORIGIN 

1 cttgagcccc 

61 atgcttccct 

121 agaagcgggg 

181 gctgacgatg 

241 aaggcttaca 

301 tgtaggccag 

361 agaggcgggg 

421 ctcctcctcc 

481 actgtcttcc 

541 ggctgcctgg 

601 aagtctgaag 

661 gcagccacct 

721 ctggtgtgca 

781 aagaaccagc 

841 cttagagcaa 

901 caacgtgagt 

961 cagactcttc 

1021 tgacgggaag 

1081 ttccggaccc 

1141 gaacctgagt 

1201 gtcctccaca 

1261 tcagggccac 

1321 agccttggtt 

1381 taaatcagtt 

1441 acagccactg 

1501 ctatcccccc 

1561 tcacaaacct 

1621 cactggaaac 

1681 gtgaggctaa 

1741 cccacaggga 

1801 tcccatttcc 

1861 ggtccccttc 

1921 cccctgctgt 

1981 cagtcacctg 

2041 gggggcaacc 

2101 ccccacacct 

2161 gggagaccta 

2221 ccgtggacag 

2281 caggtggcac 

2341 tgcactgttt 

2401 aataaaaaaa 

24 61 cgaggcaacg 

2521 acccttcctc 

2581 catccctcca 

2641 a 



/gene="Cmu" 

/note="pot. glycosylation site" 

2401. .2406 

2421 



tcccccacca ccaccacctc 



cagctgactt 
atcctggact 
gcggaagggc 
gtgttctatg 
gttcgcctac 
gggaatgcag 
tccgtttctc 
ccctggtctc 
cacgggactt 
tcaaccaggg 
ctcaggtgtt 
aagtgcacca 
cagtgggggc 
tctgagctca 
gtgttcgttc 
tgcgaagcct 
cctgtgaagt 
cgaacctaca 
gtgtacacct 
tgtgctgcca 
cctcttacaa 
tcaagggcca 
ggtactattt 
acacaatgtt 
ctcctttgtt 
ggcaacctac 
caaaactaaa 
tgtttgtgtg 
tctgccttca 
ttgtcctctg 
tgccaccata 
gtaccagcag 
cctggtgaag 
cttgtcccaa 
ctacttcacc 
cacctgtgtg 
gtccaccggt 
ctgctactga 
acacaaacta 
aagatccact 
gctgtgttgt 
tgtctgcaca 
gctgcttcca 



ctcagaggga 
gtccctgaga 
caggtccact 
gctggaaaag 
caacatatct 
atagcccctg 
ttttatccct 
ctgcgagagc 
cctgcccagc 
tgtcagaacc 
cctgcctccc 
tggcaacacc 
agaggcccag 
caccttgtcc 
catcccggga 
ctaacttcag 
ctggcttcac 
aggtcataag 
gccgtgtgga 
gtgagtgccc 
gtaaggctaa 
agaccagtgt 
ggggatagag 
ttcttgactg 
ggcatcttcc 
gacaccctga 
ctcactgaaa 
gaagactggg 
ccacaaaaga 
gtcagctcca 
tcctcaccac 
ccactagccc 
ggtttctccc 
gacaagtatg 
cacagcgtcc 
gtaggccacg 
aaacccacac 
ccatgctagc 
accatgtcag 
caaaagttgc 
gtttgccttg 
gttgcctcct 
gatacagggc 



ccgatgggct 
agcctagcag 
acaggtgtca 
tagtctagca 
atactccgta 
agccctgtct 
actggtagtg 
ctctggtcct 
cccctgtctg 
tccatttcct 
ttcccaacac 
aagagtgtcc 
aacaaagatc 
gcatggccca 
tttcactcca 
cgccttctct 
tcccaaacaa 
cacagagcca 
cacactcacg 
tcacaggggt 
tgtcctagcc 
tgccaccctg 
cttcccttaa 
acttaagctt 
caggcccatc 
tcaacaagtc 
acatctcctg 
gccaccccaa 
atagcgggaa 
aattcatctc 
gggttcttcc 
tgtctctact 
gagaacaact 
ccgcagacat 
tgaccagcgc 
tgactgtgac 
aggccctgcc 
tgtacaatgt 
ccccaaccag 
agtgagatgt 
tggttttgat 
cacgcaccct 
tctgtgtgaa 
tggaagcgtg 



tctattgttt 
gacagggcta 
gtgccccgcc 
ggcatctgtc 
agcctgcagc 
gccttaagag 
gacaagcaga 
cagagagtcc 
atgagaattt 
tctcctggaa 
tgaggatggg 
tcgagggttc 
tacgcgtgcc 
gaggaggccg 
gtcgtcacag 
ggccctgccc 
atcacagtgt 
gtgactcctg 
atcaccgaaa 
ctcaccttct 
cagtggctat 
acatggccat 
gcagaacctc 
ttctccgact 
cacagacatc 
tgccacgctg 
gtcttcccga 
tggcaccttc 
ggagttcgtg 
aaagcccaga 
tctgtgtcac 
tgcagagatg 
gatcctgagg 
ctttgtgcag 
cccaatgcgg 
agaggaggaa 
acacatggtg 
atcgctgatc 
gcaggctctg 
tgcatcttat 
tttgcaatgc 
gcagatttgc 
ctcacagagt 
gctaccctgg 



tggaatagga 
agcctatttg 
cctggagctg 
cctgtccatg 
aacttcgtcc 
caacagccac 
cagattctta 
gtcctcccca 
ggtggccatg 
ctaccagaac 
agagaagtat 
agatgaatac 
cattccaggt 
ggggctgcag 
agatgaaccc 
cacgaaagtc 
cctggctacg 
aagacagagg 
gcgactggct 
ggaagaatgt 
ccctcccaga 
ttgctccttg 
ggaggttcag 
tcacaacttg 
caagccttcc 
acctgcctgg 
agtggtgagc 
agtgccatag 
tgcaccgtga 
ggtatgcatt 
agggatggca 
aacaagaccc 
gagtcagcca 
tggcttcaga 
gagccccagg 
tggaactccg 
accgagagga 
atgtctgatg 
gggggccagt 
aaaaactaga 
ttttggctgc 
ctctgccctg 
gtaaggatgc 
tgcaggcaga 



